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THE DESCRIPTION OF FACIAL EXPRESSIONS 


IN 


TERMS OF TWO DIMENSIONS 


HAROLD SCHLOSBERG 


Brown University 


Early studies on facial expression 
stressed the inaccuracy of judgments. 
In 1938 Woodworth suggested that 
the inaccuracy was more apparent 
than real, in that it resulted from scor- 
ing judgments on a simple right-wrong 
basis. Through an analysis of pub- 
lished results from 
developed a scale, 
following six steps: (1) Love, Happi- 
ness, Mirth; (2) Surprise; (3) Fear, 
Suffering; (4) Anger, Determination; 
(5) Disgust; (6) Contempt. There 
was also a seventh category, Scatter- 
ing, for pictures which didn’t fit into 
the other six. 


earlier studies, he 
consisting of the 


The use of such a scale 
showed clearly that Ss made many 
minor errors, but few major ones. 
That is, they would frequently trans- 
pose a picture from Step 3 to Step 4, 
but they were less apt to misplace it in 
more remote steps, as I or 6. 
Schlosberg (6) tested this scale on 
a new series of pictures, those of Frois- 
Wittmann (1), published by Hulin and 
Katz (3). The pictures were sorted 
into a linear series of bins, labeled in 
terms of the Woodworth categories. 
It was immediately obvious that pic- 
tures whose modes fell in Step 6 were 
as apt to spread over into Step 1 as 
into Step 5. This could only mean 


2 


that the scale was recurrent, rather 
than linear. Hence, the computations 
of mean scale positions and average 
deviations were based on the assump- 
tion that the scale was circular; that 
is, a picture which was evenly split 
into Steps 6 and 1 was assigned a mean 
of 63, rather than of 33, which would 
have put it on the opposite side of the 
circle. 

To check on the possibility that 
these results were peculiar to one set 
of pictures, the method was applied 
to two other series. In an unpub- 
lished Honors project in 1943, Miss 
Marjory L. Brown found that the 
Ruckmick (5) series showed clear evi- 
dence of conforming to a circular scale, 
with five of the 32 pictures having 
some overlap between Steps 6 and 1. 
Another series was also tried, made up 
of supposedly unposed pictures. Fif- 
teen were from the Munn (4) series, 
and 17 additional ones were selected 
from Life magazine. Only one of this 
series spread across the “‘ends”’ of the 
scale, but this partial failure to show 
circularity may have been due to the 
selection of pictures. There was only 
one with a mode in Step 6 (Contempt) 
and the only four pictures in Step | 
showed very little spread to any other 
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categories. Hanawalt (2) has also 
reported a failure to find evidence of a 
circular scale in facial expressions, but 
this finding was only an incidental one 
in an experiment designed for other 
purposes. 

The question as to whether or not 
the Woodworth scale of facial expres- 
sions constitutes a circular or recur- 
rent series is really only a preliminary 
one. Let us assume for the moment 
that the scale describes the periphery 
of a roughly circular surface on which 
all facial expressions can be located, 
and examine the consequences. The 
circular surface may be compared with 
the color circle, with the six scale steps 
corresponding to the major hues. 
This comparison brings out the fact 
that the facial expression surface 
should have a neutral point (“gray’’) 
in the center; the strength of an ex- 
pression would correspond roughly to 
the saturation of a color. Still more 
important, one should be able to 
describe the surface in terms of two 
axes, just as the color surface can be 
described in terms of blue-yellow and 
red-green axes. Examination of the 
results of the 1941 experiment (6) led 
the writer to suggest that these two 
axes were pleasantness-unpleasantness 
and attention-rejection. The first axis 
needs no further explanation, but the 
second one does. Attention is exem- 
plified by surprise, in which all recep- 
tors are maximally open tostimulation. 
Rejection is the best term we have 
found for the other end of this axis; 
it is shown most clearly in contempt 
and disgust, in which eyes and nostrils 
appear to be actively shutting out 
stimulation (Fig. 2). 

This brings us to the really impor- 
tant question—can typical series of 
facial expressions be adequately de- 
scribed in terms of two axes? If we 
can answer the question in the affirm- 
ative, a very complex field will have 


been reduced to relatively simple 
dimensions. One way to test this 
hypothesis is to obtain ratings of the 
locations of each of a series of pictures 
on the P-U and A-R axes, and then 
to use this joint position to predict 
the Woodworth scale value of each 
picture. The present paper reports 
four separate studies. We may antici- 
pate the results by pointing out that 
three of the studies, using two differ- 
ent series of pictures, yielded corre- 
lations above .90 between obtained 
scale values and those predicted from 
independent ratings of P-U and A-R. 


PROCEDURE 


All four experiments employed the same basic 
method. The S sorted a set of pictures along a 
9-point rating scale ranging from maximum un- 
pleasantness (1) to pleasantness (9). While £ 
was tabulating these ratings, S was given dupli- 
cate prints of the same pictures to sort on a scale 
rejection (1) to attention (9). Ratings from all 
Ss were combined by simple averaging to give a 
single pair of values for each picture. The pic- 
tures were then plotted on a sheet of graph paper, 
using P-U values on the ordinate and A-R values 
on the base line. This scatter plot was then 
mounted on a large 360° protractor, with the 
center of the circle at the mid-point (5-5) of the 
axes, and the P-U axis oriented at 60° and 240°, 
corresponding to scale positions 1.00 and 4.00, 
as we will see in a moment. A thread stretched 
from the center, across a plotted point and out to 
the periphery, permitted a reading for that pic- 
ture in degrees, referred to the circumference of 
the circular surface. The degree reading could 
be converted to predicted positions on the Wood- 
worth circular scale by dividing by 60, since there 
were six steps on the Woodworth scale. These 
predicted values were then compared with those 
actually obtained by independent sorting made 
in the Woodworth bins.! 


1 If the method of predicting Woodworth scale 
position from rating data is not clear from this 
brief description, turn to Fig. 1, under Results. 
In the Discussion, it will be shown that this 
method unavoidably discards some information. 
The location of a picture on a surface described 
by two axes can give (a) the distance the picture 
lies from an arbitrary center, and (b) the radius 
on which it falls. Since the Woodworth scale is 
simply the circumference of the circular surface, 
we can utilize only (5), above, in making the 
predictions. 
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Prepicrep AND Ostainep ScALe VALUES FOR THE RucKMick PicTUREs 





6-Step Scale Position 
































} 
vies | 1943 Scale Data 
| icture 1) A | 
Description | Number | ' . BPE | Pa | ite | 
| | Predicted Obtained | Error Position | Av. Dev 
(1) (2) (3) | (4) | (5S) (6) (7) ; (8) 
Reconciliation 28 | 8.05 7.15 1.60 0.90 0.70 | 0.92 0.19 
Interested Inquiry 30 7.90 7.10 1.62 0.92 70 | 1.01 05 
Amusement 29 8.20 | 7.25 | 1.60 | 1.00 60 | 1.00 00 
Mirth 31 8.80 8.15 | 1.67 100 | 67 1.00 00 
Joy 32 8.80 8.25 | 1.68 100 | 68 1.00 00 
Adoration 17 6.60 6.25 | 1.65 1.05 60 | 1.20 29 
Wistful Appeal 26 6.95 6.75 | 1.70 1.22 48 1.10 20 
Cynical Interest 19 6.70 | 640 | 1.65 1.23 42 1.41 41 
Interested Observation | 27 7.20 6.85 | 1.67 1.27 40 1.28 45 
SASSO |——_ | ee | j——— ———— - 
Astonishment 3 4.60 5.95 | 2.87 2.05 82 2.10 28 
Consternation 24 3.85 7.10 2.98 2.05 93 2.10 30 
Pleading 20 5.25 5.70 | 2.17 2.08 09 2.50 87 
Apprehension 2 2.25 6.10 | 3.65 | 2.70 95 2.77 59 
Startled Fear 3 1.75 5.30 3.92 | 2.72 1.20 2.67 50 
Dread 7 2.05 5.75 | 3.77 | 2.85 92 3.00 30 
Anguish 8 2.05 5.10 | 3.97 | 2.97 1.00 3.00 08 
Resentment 14 3.30 | 490 | 4.09 | 3.13 6 3.45 75 
Interested Attention 18 3.25 6.45 | 3.35 3.15 .20 2.59 1.06 
Sorrow 5 4.65 5.50 | 3.12 3.19 — .07 2.85 40 
Exhausting Pain 6 2.40 3.15 4.60 3.47 | 1.13 3.75 1.13 
Anxiety 21 3.40 4.95 4.05 3.63 42 3.28 §2 
Fear 4 1.45 5.90 3.77 4.00 .23 3.93 54 
Anger | 2.00 4.60 4.13 | 4.17 — .04 4.13 38 
Distrust 12 2.65 4.20 4.32 4.37 — 05 4.65 69 
Haughtiness 15 4.35 3.75 5.05 4.52 53 4.72 85 
Contemplation 10 3.70 3.75 4.75 4.62 13 4.36 5 
Defiance 16 3.10 3.80 4.55 4.73 —.18 4.50 77 
Critical Distrust ll 3.50 3.45 4.79 4.87 — .O8 5.14 79 
Rueful Meditation 22 4.55 3.70 5.19 5.27 — .O8 4.74 91 
Scorn 13 3.60 2.25 5.05 5.58 — .53 5.68 51 
Sulkiness 9 4.00 2.60 5.13 5.68 —.55 3.50 80 
Contempt 25 6.00 5.55 7.47 6.40 1.07 6.10 33 
Mean 4.59 5.42 (r = .96) 54 


* The descriptions are slightly condensed from the “Adopted Description’ column of Ruckmick’s leaflet 
which is furnished with the set of photographs by C. H. Stoelting Co., Chicago, Ill. The numbers used in this 
study are those printed on the backs of the photographs; they do not correspond to those given in Ruckmick’s 
— paper (5). The pictures are listed in order of Woodworth’s scale position (Col. 5), with breaks to separate 
the six steps. 





In Exp. I, the Frois-Wittmann pictures were 
used. The P-U and A-R sorting was done in 
piles on a series of nine numbered cards. Little 
trouble was found in getting S to understand the 
P-U axis; it was merely necessary to stress the 
fact that he was to judge whether “The man felt 
pleasantness or unpleasantness.” The A-R axis 
caused more trouble. An effort was made to 
describe attention as openness to stimulation and 
to point out that the eyes, nostrils, and mouth 
tended to be constricted in rejection. 

In Exp. II, a more effective way was found 


for describing the rating scales. After the usual 
explanation of A-R, picture No. 23 of the 
Ruckmick series was shown as a typical example 
After S 
had placed these pictures at the appropriate 
positions on the scale, these “anchors” were 
reclaimed, and he was given the pack of Frois- 
Wittmann pictures to sort. 


of attention, and No. 13 as rejection. 


The S was allowed 

to review the anchors if he wished to do so. 
The results of Exp. II were so encouraging 

that it seemed desirable to duplicate this experi- 
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ment asacheck. Experiment III was run a year 
later, with only minor changes over II. Two 
wooden racks of bins were used to facilitate 
handling and tabulating the pictures, and fresh 
prints of the pictures were used. 

Experiment IV was planned to check the 
results of II and III on another series, that of 
Ruckmick, which consisted of 32 pictures of a 
female face. Both the P-U and A-R scales were 


cD 









ro 


r8 


+7 


anchored by showing pictures from the Frois 
Wittmann series, each end being exemplified by 
six pictures. In this experiment, Ss sorted the 
pictures into the Woodworth scale bins both 
before and after the rating, using the same rack 
and method employed earlier (6). This was 
done largely to build up a new set of scale values 
for this series, which could be compared with 
those obtained approximately a decade earlier. 
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Fic. 1. The location of each of the 72 Frois-Wittmann pictures on an oval surface. 
No. 10 (encircled) serves as an example of the method used. It is plotted at axis values of P-U = 7, 
A thread is then stretched from the intersection of 
the axes (5-5) across the plotted point, and its position read off in degrees. 


A-R = 7, as determined by the rating scales. 


the predicted Woodworth scale value (1.75) which is compared with that obtained earlier by direct 
sorting on the Woodworth scale (1.65). This picture might be called, “Pleased Surprise,” and 
both predicted and obtained values place it in the appropriate step of the Woodworth scale. 
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TABLE 2 

CoRRELATIONS AND AVERAGE ERRORS BETWEEN PREDICTED AND OsTaAINEeD ScALE VALUES 
Exp. Pictures Used Type of Anchor | Ss ? \verage/ Error 
a — — —— —————— a ee ae an | eee — —E —E 

I Frois-Wittmann Verbal 8 .76 not avail. 

Il Frois-Wittmann Samples 9 94 not avail. 
iil Frois-Wittmann Samples 18 92 a ae 
IV Ruckmick Samples 20 .96 s. 


| 


These average errors were obtained against an adjusted scale. 


(Experiments I-III used the Schlosberg scale 
values for the Frois-Wittmann pictures.) 

All experiments were conducted by under- 
graduates as special projects in the writer’s labor- 
atory course.2 The students made their own 
analyses of the data, but the writer has verified 
and extended all computations in Exp. III and 
[V. All Ss in the first three experiments were 
members of the same course, and were approxi- 
mately equally divided as to sex. There were 
eight Ss in Exp. I and nine in II. In Exp. III, 
there were 18, but two Ss repeated the experi- 
ment, giving 20 sets of data. Of the 20 Ss in 
Exp. IV, 8 were from the laboratory course, and 
the remaining 12 were nonstudents, ranging in 
age from 20 to 55 yr. No S participated in 
more than one experiment. 


RESULTS 


Table | presents the detailed results 
from the Ruckmick series in Exp. IV.* 
For comparison with the present data, 
the unpublished results obtained in 
1943 by Miss Brown are also pre- 
sented. The older scale positions were 
based on three sortings by each of 23 
Ss. The method of deriving the pre- 
dicted scale position from P-U and 
A-R ratings is given in the Procedure, 
and is illustrated in Fig. 1. 


2 T wish to express my thanks to the students 
who served as Es in the following experiments: 
I—Robert F’. Shepard, Jr.; 1I—Mark T. Sheehan; 
I1I—Louise Anthony, Jean M. Coraci, Sheila K. 
Hart, Ann M. Thomas; IV—Shirley A. Juskalian, 
John J. Kennedy. Dr. W. S. Hulin kindly sup- 
plied several sets of the Frois-Wittmann pictures, 
which speeded up the collection of data. 

3 The data on individual pictures of Exp. III 
will not be tabulated, since the ratings and pre- 
dicted scale positions can be determined from 
Fig. 1 and the obtained scale positions are in 
print (6). 


Table 2 summarizes the results of 
all four experiments together with 
key words to identify the methods. 
Notice that all three experiments 
which employed anchors yielded pre- 
dictions which correlated above .90 
with the scale positions obtained by 
sorting the pictures into the Wood- 
worth scale. One may question the 
precise significance of these correla- 
tions, for the scale values are not 
normally distributed. Further, the 
coefficient does not yield all the infor- 
mation that might be desired. 

A more revealing analysis can be 
made by examining the errors in pre- 
diction. Mean errors were computed 
for Exp. III and IV, and are included 
in Table 2. In both experiments, the 
average error of the prediction was 
about half a scale step; chance would 
give three times this error, for in a 
circular scale of six steps the maximum 
error is three steps, and the average 
error would be one and one-half steps. 


Discussion 


The correlations and the relatively 
small errors show that reasonably 
good predictions of Woodworth scale 
positions can be made from ratings on 
two axes, pleasantness-unpleasantness 
and attention-rejection. There is 
reason to believe that a large portion 
of what errors there are may be attrib- 
uted to the scale itself, and to certain 
working assumptions made in relating 
the scale to the axes. These assump- 
tions were (a) that the center of the 
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circular scale was at axis position 5—5, 
the mid-points of the two 9-point rat- 
ing scales, (b) that axis ends P, A, U, 
and R fell at circular scale positions 
1.0, 2.5, 4.0, and 5.5, respectively, and 
(c) that the six steps on the circular 
scale were equal. The results should 
enable us to refine these assumptions, 
and to improve the prediction. 


Examination of Table 1 will show that the 
second assumption is markedly in need of correc- 
tion for the Ruckmick series. Column 6 shows 
a marked preponderance of positive errors, in 
which the axis positions predicted scale values 
that were too high. The constant error is ap- 
proximately .43 scale steps; when most of this is 
removed by rotating the plotted surface counter- 
clockwise, putting P at .60 scale position, the 
average error drops from .54 to .42 scale steps. 
Simple rotation was not tried on the Frois- 
Wittmann results (Exp. III) for there was little 
indication of a constant error throughout the 
scale. 

Assumption (a), the locus of the mid-point of 
the circular scale, does not seem to be critical. 
Recalculations were made on both series of pic- 
tures after shifting the center to axis positions 
5-54; 4-5; and on the Ruckmick series, 44-4}. 
These shifts increased the average error slightly, 
but never beyond one of .55 scale points. 

Assumption (c), that the units are equal 
throughout the scale, is obviously only a gross 
approximation. In the Ruckmick series the 
variability of judgment, as shown by the average 
deviation, is about one-third as large in Step 1 
as in the other five steps. This suggests that 
Step 1 is three times as big as any of the other 
steps. ;As a way of testing the effects of a rough 
adjustment of the circular scale, a base line was 
marked off with the first step doubled in width, 
and the others cut to .8 units. Using predicted 
scale positions on the ordinate, the Ruckmick 
circular scale positions were plotted on this base 
line, and a line with a slope of one was fitted to 
the points by inspection. The deviations of the 
predictions from this line were then measured 
and averaged, yielding a mean error of .33 scale 
steps. This was by far the smallest average 
error of prediction obtained. A similar treat- 
ment of the Frois-Wittmann data, using a scale 
adjusted to conform to the average deviations of 
this series (6), gave an average error of .49 scale 
steps. Omitting one picture which cannot be 
adequately located on the circular scale (No. 43, 
see below), the average error becomes .46. 

An average error of prediction of .33 scale 
steps is probably close to the minimum. One 
limiting factor is the stability of the circular- 


scale positions we are trying to predict. In 
Table 1, two sets of such positions are given for 
the Ruckmick series, the set obtained in Exp. IV, 
and that obtained a decade earlier. The average 
difference between these two sets of values is .23. 
The reason for this difference is not hard to find; 
the scale forces all pictures out to the circum- 
ference, when some of them belong near the cen- 
ter of a circular surface. This is particularly 
clear in the case of picture No. 43, which appears 
at the center of both Fig. 1 and 2. It received 
an axis rating of 5.0 and 4.9, leading to a pre- 
dicted position of 5.50 0n the circular scale. But 
if 4 of the 20 Ss had increased their rating by one 
step on the A-R scale, thereby changing the 
picture’s mean rating to 5.1 on the other side 
of the center, the prediction would have been 
2.50. Thus the circular-scale position of this 
picture is largely a matter of chance. This 
showed clearly in the results of the 1941 experi- 
ment, for about one-third of the Ss put this pic- 
ture in the “Scattering” bin, and the rest dis- 
tributed it almost randomly in five of the six 
main categories. This picture is an extreme 
case, but there were others that showed large 
spread; for example, 8 of the 32 Ruckmick pic 
tures showed average deviations (Table 1, col. 8) 
greater than .75 scale steps. For such pictures 
it seems probable that the axis positions give a 
better description than do the circular-scale 
values. That is, a relatively neutral picture like 
No. 43 belongs not at the circumference of the 
circular surface, but near its center. 


Now that we know we are dealing 
with the whole surface rather than 
with its circumference, we can ask for 
a description of the surface. Like the 
“color circle,” it is only roughly cir- 
cular. Figure 2 probably gives a 
reasonable first approximation to its 
outline, for it includes all the extreme 
pictures. It was not practical to 
mount all the pictures in the more 
crowded areas, but their location is 
indicated by the scatter diagram repro- 
duced in Fig. 1. The surface seems 
to be roughly oval in form, with 
the P—-U axis longer than the A-R 
axis. ‘This agrees with the conclusion 
drawn in the writer’s 1941 paper, based 
on the distribution and average devi- 
ations of the pictures on the circular 
scale. It is difficult to determine the 
exact form of the surface, for there is 
no reason to believe that the distances 
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Fic. 2. The location of typical pictures of the Frois-Wittmann series. This display includ 
those pictures which are shown around the margin of the surface in Fig. 1, and as many of the mor 
centrally located ones as could be mounted. The average error of prediction for these 25 
is a half-scale step, just as it is for the whole series of 72 pictures. The picture with the worst pr 
diction of the whole series is No. 43, at the intersection of the axes (see text 
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are equal on the two axes. The pres- 
ent rating data are not extensive 
enough to warrant any elaborate scal- 
ing attempts. There is also the ques- 
tion as to whether or not the two axes 
are perpendicular; the difficulty some 
Ss experienced in distinguishing be- 
tween rejection and unpleasantness 
suggests that the included angle is 
somewhat less than 90°. Perhaps 
some of these relations might be 
worked out by comparison with the 
Hulin and Katz (3) data. They had 
their Ss sort the Frois-Wittmann pic- 
tures into piles, putting together any 
pictures that seemed similar. If our 
description of the circular surface is 
correct, the distance between any two 
pictures should vary inversely with 
the number of times the two pictures 
were put inthe same pile. But before 
this comparison is attempted, it would 
probably be advisable to obtain more 
precise ratings on P—U and A-R than 
our 20 Ss have furnished. 

One final point may be added—the 
writer is still not satisfied with the 
present method of obtaining ratings 
on the A-R axis. Inspection of the 
individual ratings assigned to the same 
picture shows that they occasionally 
range from 1 to9. The fact that the 
final predictions work as well as they 
do shows that these errors average 
out; there seems to be a genuine di- 
mension that shows up in spite of the 
difficulty with which it is described 
and judged. Perhaps the difficulty is 
due to the fact that our language has 
no good word for the “rejection” end 
of the axis; attention and inattention 
describe only half the axis. But cer- 
tainly the fact that common speech is 
not cognizant of the whole range of a 
dimension proves little about the util- 
ity of that dimension in analyzing 
facial expression, except only that the 
dimension may be difficult to handle 
experimentally or to describe in any 
but operational terms. 


The question was raised earlier as 
to whether or not the Woodworth 
scale was recurrent. The results of 
the foregoing analysis show clearly 
that the scale represents the circum- 
ference of an oval surface into which 
some pictures are forced by lack of a 
better place to put them. If a given 
series contains one or more pictures 
that lie somewhat above and slightly 
to the left of the center of the surface, 
as at axis position P—U 6, A-R 4.5, 
they will be projected to a circular- 
scale position of 6.5. This means that 
they will be judged in Step 6 (Con- 
tempt) half the time, and in Step 1 
(Love, Mirth, Happiness) the other 
half of the time. This spread across 
the ends of the scale happened in both 
the Frois-Wittmann and Ruckmick 
series, and furnished the original evi- 
dence for circularity. For example, 
the Frois-Wittmann picture No. 15 
fell at 6.24.2, resulting in a predicted 
value of 6.21, compared with an 
obtained value of 6.27. But the 
obtained value represents a mode of 
54 judgments in Step 6 with most of 
the spread, 50 judgments, to Step 1. 
The Frois-Wittmann series happens 
to be rich in pictures that fall in this 
area, which probably accounts for the 
fact that the circularity is so appar- 
ent. On the other hand, there hap- 
pened to be no picture in Miss Brown’s 
supposedly unposed series that could 
have been expected to spread across 
Steps6and 1. The pictures of Mirth 
showed no spread at all, and the only 
picture in Contempt was very close to 
Disgust, leaving a gap of a whole step 
at the crucial point. Parenthetically, 
it may be added that Contempt is a 
sophisticated expression, and is rather 
hard to judge. It is interesting that 
the oval surface has a place on which 
it fits so nicely. 

But to return to the circularity of 
the Woodworth scale. It was a for- 
tunate choice of pictures that led to 
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the original detection of the circular- 
ity, for there is no assurance that a 
given set of pictures will show the 
overlapping spread across the two sup- 
posed ends of the Woodworth scale. 
Failure to find such spread does not 
alter the more fundamental fact that 
all pictures of two different series, 72 
posed by a man and 32 by a woman, 
can be placed rather well on an oval 
surface described in terms of two 
axes. These axes are proposed as the 
basic dimensions in which facial ex- 
pressions may vary. The fine shades 
that are familiar in literary usage may 
well involve additional factors, as 
knowledge of the situation and of the 
past history of the individual. 


SUMMARY AND CONCLUSIONS 


1. In three independent experi- 
ments, using 8, 9, and 18 Ss respec- 
tively, ratings of the 72 Frois-Witt- 
mann pictures of facial expressions 
were obtained on two nine-point rat- 
ing scales: Pleasantness-Unpleasant- 
ness and Attention-Rejection. In a 
fourth experiment with 20 Ss, similar 
ratings were obtained on the 32 
Ruckmick pictures. 

2. Using these rating scales as rec- 
tangular axes, each picture was located 
on a roughly circular surface. 

3. By referring the plotted position 
of each picture to the circumference 
of a circle drawn around the surface, 
an attempt was made to predict the 
value each picture had received on a 


4 Many of our Ss reported that they tended to 
judge P-U on the basis of the mouth, and A-R 
from the eyes. This is consistent with Hana- 
walt’s (2) report that happiness is often confused 
with pain-suffering when judgments are based on 
only the upper half of unposed pictures. If the 
mouth is blotted out, S should have difficulty in 
distinguishing between P and U, the dimension 
which differentiates happiness from pain-suffer- 
ing. Careful study of confusions in half-face 
judgments might throw further light on the 
determiners of our two axes. 
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recurrent six-category scale of the 
Woodworth type. When the predic- 
tions were based on a poorly-anchored 
A-R scale (Exp. I), they yielded a 
correlation of .76 with obtained values. 
Anchoring the rating scales by sam- 
ples from another series of pictures 
(Exp. II, III, IV) raised the correla- 
tion to .94, 92, and .96, respectively. 

4. The average error of prediction 
was of the order of half a scale step, 
which is one-third the error expected 
on a random series on a six-step cir- 
cular scale. By adjustment of axes 
and scale steps, the error of prediction 
could be made to approach the limits 
set by the accuracy of the original 
scale judgments. 

5. It is concluded that two widely 
used series of facial expressions can 
be described very well by locating 
them on a roughly oval surface whose 
longer axis is pleasantness-unpleasant- 
ness, the shorter axis being attention- 
rejection. Recognition of finer shades 
of emotion may depend on knowledge 
of the stimulus situation. 


(Received February 25, 1952) 
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REMOTE ACTION POTENTIALS AT THE MOMENT 
OF RESPONSE IN A SIMPLE REACTION- 
TIME SITUATION? 


ROBERT L. HENDERSON 


University of Missouri 


Meyer (6) has reported an experi- 
ment designed to test Dunlap’s hy- 
pothesis that practice in a simple 
reaction-time (RT) situation will result 
in progressive decrease of muscular 
contractions in nonparticipating body 
parts at the moment of each reaction. 
Meyer’s results failed to provide any 
confirmation of this hypothesis, and he 
concluded that the presentation of the 
simple reaction-time task may give rise 
to either positive or negative incre- 
ments of amplitude of remote muscular 
contractions. While admitting that 
his data are influenced by a number 
of incompletely controlled variables, 
Meyer suggests that the Dunlap 
hypothesis requires revision (6, 7). 

Daniel (2) has subsequently pointed 
out certain other weaknesses in the 
design and execution of the Meyer 
study which leave the validity of the 
published results questionable. The 
present study represents an effort to 
test further Dunlap’s original hypoth- 
esis, using techniques and procedures 
similar to those used by Meyer, while 
trying to correct certain of his errors. 


EXPERIMENTAL DEsIGN 


Subjects.—The Ss were 20 undergraduate stu- 
dents in psychology at the University of Mis- 
souri. All Ss were obtained on a volunteer basis 
to insure adequate motivation, and were naive 
as to the purpose of the experiment. 

Action potential apparatus.—The Ss were iso- 
lated from the electronic equipment by a shielded 


1 Graduate research carried out at the Uni- 
versity of Missouri under the direction of Dr. 
Robert S. Daniel, to whom the writer is grate- 
fully indebted for suggesting the research and 
assisting in its design and execution. 


cage constructed of copper screen wire properly 
grounded. Shielded leads from the surface elec- 
trodes connected into one channel of a four- 
channel Rahm amplifier designed for bioelectric 
measures. The output of this amplifier was fed 
directly onto the vertical plates of a cathode 
ray oscilloscope, with the resulting signal photo- 
graphed by a 35-mm. moving film camera. 

Power was applied to the electronic apparatus 
at least 1 hr. before experimentation was to take 
place to insure proper “warm-up” of the equip- 
ment. At the beginning and end of each session, 
the apparatus was calibrated. This was accom- 
plished by feeding a 100-cps tone from an oscil- 
lator, through a microvolter, into the input of 
the amplifier. By systematically varying the 
input voltage and measuring the corresponding 
amplitude of the photographed signal, it was 
possible to graph the output amplitude as a func- 
tion of input microvoltage. 

Reaction time apparatus.—The reaction studied 
was that of a simple key-pressing response to a 
visual stimulus. A standard telegraph key was 
mounted on a table before S. A 1-in. diameter 
round window was cut in a cardboard screen and 
covered with translucent white paper. The 
screen was then mounted so that the stimulus 
window was at S’s eye level and at reading 
distance. 

A small slide projector located outside the 
shielded cage served as the light source. It was 
directed in such a manner that a small beam of 
light illuminated both the stimulus window and 
a small mirror mounted on S’s key. A magnetic 
shutter, normally closed, was used to block the 
beam of light between trials. By pressing a key, 
E could open the shutter and simultaneously 
illuminate the stimulus window and the record- 
ing mirror. This mirror reflected the beam of 
light to a second mirror, mounted on the face of 
the oscilloscope which in turn reflected the light 
into the camera where it was photographed as a 
solid black line. The slightest depression of S’s 
key deflected the beam of light so that it was no 
longer photographed. 

At the opposite side of the oscilloscope from 
the RT mirror was a second light source which 
was interrupted 100 times per second by a cir- 
cular shutter driven by a constant speed motor. 
On the photographic record this appeared as a 
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series of discrete dots, each representing a time 
interval of .01 sec. 

Procedure-—A major departure from the 
Meyer design was the use of the left arm instead 
of the right leg for purposes of obtaining action- 
potential records. Daniel (1) had shown earlier 
that the action potentials in the left arm are 
stronger than those in either leg when the right 
arm is active. Preliminary tests of the appar- 
atus and procedure in the present study con- 
firmed Daniel’s results and justified the change 
in procedure. It should be noted that while this 
change prevents a direct comparison of the 
results of the present study with those obtained 
by Meyer, it is not inconsistent with the hypoth- 
esis as originally stated by Dunlap. 

A second important departure from the pro- 
cedure used by Meyer was the adjustment of the 
amplifier gain control during each session to 
maintain as large an action-potential amplitude 
as could be accommodated by the 3-in. cathode 
ray tube of the oscilloscope. Had Meyer’s pro- 
cedure of maintaining the same amplifier gain 
throughout the experiment been followed, it is 
very probable that a number of responses would 
have been lost inasmuch as it was frequently 
necessary to increase the sensitivity of the ampli- 
fier during the course of each session to maintain 
a measurable action-potential signal. 

Upon entering the experimental room on the 
first practice day, S was seated before the RT 
key and given a set of printed instructions to 
read. These instructions were as nearly iden- 
tical to those used by Meyer as the changes in 
experimental design permitted. 

The active electrode was then placed over the 
thickest portion of the flexor carpi radialis and 
the neutral electrode over the thickest portion 
of the upper arm above the unused hand. After 
answering any questions on procedure asked by 
S, E left the shielded cage and proceeded to the 
AP apparatus, out of sight of S. 

At approximately 10- to 15-sec. intervals E 
would say “ready,” and, after an interval of 
from 2 to 4 sec., would press the key actuating 
the stimulus light shutter. The shutter then 
remained open until E either saw or heard the 
S’s reaction. 

Each of the 20 Ss were given six practice 
periods, one every two days. Each period con- 
sisted of 100 responses, with every tenth response 
recorded photographically. The average of the 
ten recorded responses was used as S’s score on 
that day. A list containing each S’s name and 
his individual mean RT score for each practice 
day was kept in the experimental room. As in 
Meyer’s study, each S was allowed to inspect 
this list before each practice session and compare 
his own score with that of the other Ss. 

The throwing of the clutch connecting the 
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Fic. 1. A typical record showing the time 
line, with each dot equal to .01 sec., the action 
potential with the measured area indicated, and 
the reaction-time line indicating the time inter- 
val from the onset of the stimulus to S’s reaction 


camera drive to a continuously running electric 
motor introduced a slight change in noise level 
and tone. However, introspective reports ob- 
tained from Ss at the end of the experiment indi- 
cated that none realized that only every tenth 
response was being recorded. It should be men- 
tioned that the camera drive motor and its 
associated reduction and drive gears provided a 
noise level more than adequate to mask the click 
of the magnetic shutter controlling the stimulus 
light, eliminating the possibility that Ss learned 
through practice to respond to the auditory 
stimulus of the shutter rather than to the stim- 
ulus light. 

Measurement.—The sample appearing in Fig. 
1 indicates that typical action-potential records 
were obtained. Inasmuch as the response in 
question appeared in the form of a group or 
cluster of spikes varying in number and temporal 
association, a measure of the spike of greatest 
amplitude was not considered to give a compre- 
hensive picture of the entire response. Conse- 
quently a procedure similar to that used by Davis 
(3, 4) was utilized. This consisted of summating 
microvoltages of the three largest spikes which 
occurred within a period of .1 sec. before the 
response and .2 sec. following the response. 
Each S’s action-potential score (AP) for each 
day was, then, the mean of these summated val- 
ues for the ten responses recorded. 

The RT score was obtained very simply by 
counting the number of .0l-sec. dots which 
occurred within the period from the onset of 
the stimulus until the reaction was made. 


RESULTS 


Table 1 contains the mean RT for 
all 20 Ss on each of six successive prac- 
tice days along with the mean AP in 
microvolts. Except for a small rever- 
sal on Day 4, it will be seen that the 
mean RT decreases steadily, while the 
mean AP drops consistently for the 
first four days, then rises on Days 5 
and 6. 
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TABLE 1 


Retationsnip BETWEEN Reaction Time, 
Action PoTenTIAL, AND AMOUNT 
OF PRACTICE 














RT in .01 Sec. AP in Microvoits 
Practice 
Day 
Mean SD Mean SD 

1 21.32 1.51 54.78 23.45 
2 18.61 1.91 53.16 26.11 
3 17.65 1.94 38.78 18.17 
+ 17.90 2.55 33.06 14.44 
5 17.08 1.91 40.64 19.66 
6 16.75 1.89 40.65 23.59 

















After utilizing Bartlett’s test for 
homogeneity of variance, a trend anal- 
ysis was run on the AP data using the 
analysis of variance technique. This 
is summarized in Table 2. The ob- 
tained F of 4.76 (for 5 and 95 df) is 
significant at the 1% level of confi- 
dence and indicates that the curve of 
the AP means, shown in Fig. 2, differs 
significantly from a horizontal straight 
line. 

Further analysis with the ¢ test indi- 
cates a significant difference between 
Day 1 and Day 4 (t = 3.05; 2.86 
needed at 1% level) and between 
Day 1 and Day 6 (t = 2.13; 2.09 
needed at 5% level), but no significant 
difference between Day 4 and Day 6 
(t = 1.46; 2.09 needed at 5% level). 
Hence the terminal rise in AP may be 
attributed to chance fluctuation and 
is not necessarily indicative of a real 
rise. 

A second analysis of the data was 
run on the tension level during the 











TABLE 2 
Trenp Anatysis or AP Data 
Source df oe F* 
Total 119 61645.05 
Br 5 7358.49 4.76 
Bs 19 24889.08 
IxS& 95 29397.48 














* F = 4.76 is significant at the 1% level of confidence. 


foreperiod. These data are, in gen- 
eral, consistent with those obtained on 
the AP itself and will not be reported 
here in detail. The same trends were 
present with the mean tension level 
decreasing for the first four practice 
days and rising again on the last two 
days. A trend analysis following 
Lindquist’s method (5) indicated that 
the two curves were essentially par- 
allel, with the curve for the AP data 
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Fic. 2. Relationship between mean reaction 
time and mean action potential on each of six 
successive practice days 


significantly higher than the curve for 
the foreperiod tension. 


Discussion 


The results of the present study are 
consistent with the hypothesis that 
tensions in a nonparticipating body 
member declines during learning or 
practice of a response by a restricted 
muscle group. The level of action 
potential is seen to decline steadily 
during early learning of a simple reac- 
tion-time response, then to rise as the 
practice is continued. Although the 
terminal rise in action-potential level 
was not statistically significant over 
the period covered by the present 
investigation, the trend obtained sug- 
gests that had practice been continued 
the rise might have attained statistical 
significance. 

A tentative hypothesis may be 
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offered to explain such a terminal rise 
in tension level. We may note that 
the rise in tension coincides with the 
asymptote of learning of the response 
in question. It seems likely that the 
Ss, under high motivation in a com- 
petitive situation, would tend to be- 
come more “anxious” or more “tense”’ 
when they find, late in practice, that 
their reaction-time scores are showing 
less and less improvement from session 
to session. Such an increase in gen- 
eralized tension would be expected to 
raise the absolute value for the action 
potential inasmuch as it is added to, 
or superimposed upon, the general 
tension level. 

This hypothesis would suggest that 
a more adequate test of the Dunlap 
hypothesis would consist in partialling 
out the tension present in S prior to 
the response or the onset of the pre- 
paratory set, and analyzing the re- 
maining action-potential values which 
may be considered more or less directly 
to be a function of the response itself. 


SUMMARY 


Action-potential measurements were 
taken from a nonparticipating body 
member of a group of 20 Ss responding 
in a simple reaction-time situation to a 
visual stimulus. The reaction time 
was found to decline steadily over a 


period of six successive practice days 
of 100 trials each. The action poten- 
tial during the foreperiod and at the 
moment of the response was found to 
decline during the first four days of 
practice, then to rise on Days 5 and 6. 
Although the terminal rise in action 
potential was not statistically signifi- 
cant, a tentative hypothesis was ad- 
vanced which explains the rise in terms 
of increased general tension arising 
from increased motivation to improve 
as the asymptote of learning is reached. 


(Received February 20, 1952) 
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DIFFERENT STRENGTHS OF SET IN THE 
SOLUTION OF ANAGRAMS! 


IRVING MALTZMAN AND LLOYD MORRISETT, JR. 
University of California at Los Angeles 


A major problem confronting the 
investigator of human problem solving 
has been to account for the “direction 
of thought,” the sequence of “‘ideas.”’ 
In a problem situation one relevant 
attempt at a solution occurs out of a 
great number of possibilities. It may 
be rejected if incorrect, retained if cor- 
rect. If incorrect, other relevant 
attempts may occur, while still others 
do not occur at all. Investigators in 
this area have tried to account for the 
direction of thought with the aid of 
such concepts as determining tend- 
ency, Aufgabe, set, direction, stress, 
and so on (5).? 

There have been numerous experi- 
mental demonstrations of the opera- 
tion of mental set in problem solving 
(2). One such study related to the 
present experiment is that of Rees and 
Israel (8). The problems used by 
these investigators were five-letter 
anagrams. In one experiment Ss 
were required to solve anagrams that 
had only one possible solution, either 
a word referring to some aspect of 
nature or a word referring to some 
aspect of eating. The instructions 
given did not in themselves lead to the 
arousal of a particular set for one or 
the other of these classes of anagrams. 
Following training on either the eating- 
set or nature-set anagrams, a test 
series was given in which the anagrams 


1 This study was supported by a grant from 
the Committee on Research, University of 
California. 

2 This paper will be concerned with the prob- 
lem of mental set as distinguished from direction. 
The distinction is similar to that made by Maier 
(6), although it would differ in the details of the 
related theory. 


had several possible solutions only one 
of which was a nature or an eating 
word, depending upon the previous 
training conditions. The higher per- 
centages of Ss with set training that 
gave eating or nature solutions on the 
test series as compared with Ss that 
did not have such training indicated 
the establishment of a set. 

From the point of view of S—R be- 
havior theory the problem of the 
direction of thought is similar to the 
problem confronting the learning the- 
orist attempting to account for the 
course of trial and error or selective 
learning. Hull has attempted to do 
this in terms of principles derived 
from conditioning experiments (3). 
The basic assumption of this paper is 
that these same principles can be 
employed in accounting for the direc- 
tion of human problem solving. 

Hull (4) has assumed that habit 
strength increases as a negatively 
accelerated function of the number of 
reinforcements, specifically, as an 
exponential function. It follows that 
with repeated trials in a choice situ- 
ation a difference in habit strength 
would develop more rapidly between 
two weak habits of equal strength 
than two strong habits of equal 
strength. 

Siegel (9) tested this implication by 
running two groups of rats in a T 
maze. One group received 40 rein- 
forcements in the left arm and 40 rein- 
forcements in the right arm. The 
second group received two reinforce- 
ments in the left and two reinforce- 
ments in the right arm. The Ss of 
both groups were then given 48 trials 
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in which reward was received in only 
one arm of the T maze, either the left 
or the right. It was found that the 
weak habit-strength group making 43 
mean correct responses learned signifi- 
cantly faster than the strong habit- 
strength group making 39 mean cor- 
rect responses. This is in accord with 
the implication of Hull’s theory. 

The design of the present experi- 
ment is similar to Siegel’s in its estab- 
lishment of weak and strong competing 
reaction potentials. The problem situ- 
ation was the simple anagram having 
only one correct solution (8). Com- 
peting response tendencies were the 
class of response solutions elicitable 
by the class of eating anagrams and 
the class of response solutions elicit- 
able by the class of nature anagrams. 

The hypothesis tested in the experi- 
ment to be reported is that if “mental 
set” refers to the acquisition of habit 
strength by a class of stimuli for the 
elicitation of a particular class of 
responses, the implication of Hull’s 
theory for different absolute initial 
habit strengths should also hold for 
incompatible anagram sets. There 
should be an inverse relationship be- 
tween amount of practice with incom- 
patible classes of anagrams and subse- 
quent success with one of them. - 


METHOD 


Subjects ——Forty Ss from introductory psy- 
chology classes were employed in the experiment, 
20 in each of the two major conditions. 

Stimulus materials —Five-letter anagrams 
typed in lower case on white 3 X 5 in. index 
cards were used as the problems. Most of the 
anagrams were obtained from lists published by 
Rees and Israel (8). Some of these had to be 
discarded following preliminary experimentation 
when alternative solutions belonging to different 
classes were discovered. Additional anagrams 
were constructed by the writers. The anagrams 
finally used in this study are presented in Table 1. 

Three different classes of anagrams were used, 
eating-set, nature-set, and neutral anagrams. 
Most of the eating- and nature-set anagrams had 
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TABLE 1 


ANnaGRAMS EMPLOYED IN THE EXPERIMENT 








Practice Nature-set Training 
cikbr _ brick lutpi _tulip 
ihrac chair rokbo brook 
saree erase sarts _ stars 

otsmr storm 
Neutral Training ae seeds 
ct tects esnfr _ ferns 
labls _ balls o— 1 
aetch cheat, teach = pet 
racig cigar — lw 
tegas gates, stage — ae 
ribet tribe alile lilac 


vreri river 
nirsa rains 


ypppo poppy 
charn ranch 


Inein linen 

iplsl pills, spill 
mesoc comes 
jonye enjoy 
sracf _scarf 


armch march, charm Eating-set Training 


Isalh _ shall, halls plepa apple 
arypt party ackse cakes 
oautb about skanc snack 
tirew write upenr rune 
wide dwell fiken nife 
admle medal, lamed teibs _ bites 


nuecl uncle — oe 
po te _ii pee 
aa poco 3 oabne bacon 
cet Get ny (at 
lichld child , 7 


- - vesno ovens 
anil andl acmer cream 
P P reybr__ berry 


Nature-set Test Eating-set Test 


sdyia_ daisy 


rus syru 

donsp ponds dugfe fetes 
ekcer creek bracs crabs 
mawps swamp elnuh lunch 
Sargs grass asad! __ salad 
nsswo snows turif fruit 

gofgy foggy rofsk forks 
ttye) jetty Tagep grape 
rmtho thorn, north yllej jelly 

eoanc ocean, canoe kinrd drink 





only one possible solution, a word referring to 
some aspect of nature or eating. In the few 
cases where alternative solutions were possible, 
the alternative belonged to the same class of 
responses. Neutral anagrams had one or two 
solutions neither of which was directly or obvi- 
ously related to eating or nature. 
Procedure.—The strong habit-strength group 
received 15 eating- and 15 nature-set anagrams 
in a prearranged order based on drawings from a 
table of random numbers. The weak habit- 
strength group received 26 neutral anagrams 
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followed by two eating- and two nature-set 
anagrams in an ABBA order. Before each S in 
the weak habit-strength group was run, the 
eating and nature anagrams were selected at 
random from those used with the strong habit- 
strength group. The two groups were thus 

\ equated for the total number of training trials. 
This equating procedure was necessary, because 
preliminary experiments indicated that amount 
of practice was a significant variable influencing 
the subsequent solution of a particular class of 
anagrams. A test series of ten anagrams was 
given immediately following the conclusion of 
the training series. Half of the Ss were tested 
with eating-set anagrams while half received 
nature-set anagrams. 

A 2-min. time limit was enforced for each 
anagram during the training and test series. If 
S did not solve the anagram within 2 min. or 
gave an incorrect solution, a correct solution was 
given by E. The experiment was conducted in 
a small room containing only a table and chairs 
for E and S. The E placed each anagram card 
on the table in front of S and removed it from 
view before the next anagram was presented. 
The S’s solution time was determined with the 
aid of a stop watch. 

The following instructions were given to Ss of 
both groups: “This is an experiment on thinking. 
You are to solve the five-letter anagrams that I 
will present to you, as quickly as possible. A 
five-letter anagram is simply five scrambled 
letters. Your solution must use all the letters, 
that is, it must be a real five-letter word. It 
must not be a proper noun or slang. As soon as 
you have the answer, call it out to me and I will 
give you another to solve, but be sure your word 
fits the five letters before you name it. If you 
can’t find a solution within 2 min. we will go on 
toanother. Hereisanexample: . . . Nowsolve 
this one . . . and now this one... . . Are there 
any questions? Do not ask any questions once 
the experiment has begun. Remember, call out 
your answer as soon as you find one.” 


RESULTS 


Table 2 shows the mean number of 
errors for the weak (W) and strong (S) 
habit-strength groups for Trials 1—5 
and 6-10. The significance of the 
difference between Trials 1-5 and 6-10 
in Cond. W was determined by the t 
test for related measures. A t of 2.66 
was obtained. For 19 df it is signifi- 
cant at the .02 level, indicating that 
some learning had occurred during the 
test series. A similar test for Cond. S 
was unnecessary, since it showed a 


TABLE 2 


Mean Errors oF W anv S Groups In TRIALS 
1-5 anp 6-10 or Test Series 

















Group W Group S 
Trials 
Mean SD Mean SD 
1-5 1.35 1.10 1.05 74 
6-10 50 67 1.15 1.20 

















slight increase in mean errors during 
Trials 6-10. The ¢ test of the differ- 
ence between the differences of the 
two groups was 2.38 (p = .02, for 38 
df). Anonparametric u test (7) indi- 
cated that the distributions of errors 


committed by the W and S groups on 


Trials 1-5 were not significantly dif- 
ferent. A u of 1.83 was obtained for 
Trials 6-10 which is significant at the 
.03 level of confidence (one-tailed test). 
No significant differences were ob- 
tained between nature-set anagrams 
and eating-set anagrams for the test 
series. 

Following the conclusion of the test 
trials E questioned each S with respect 
to his knowledge of the purpose of the 
experiment, whether he noticed any 
similarity or relationship among the 
anagrams, and his method of attack- 
ing the problems. None of the Ss 
reported that they were aware of the 
purpose or design of the experiment. 
The protocols indicated that 75% of 
the Ss tested with eating-set anagrams 
verbalized this similarity among the 
anagrams. Eighty per cent of the Ss 
tested with nature anagrams verbal- 
ized this relationship. Ten per cent 
of the Ss indicated some recognition of 
both eating and nature anagrams. 
The latter were all in the strong habit- 
strength group. Seventy-five per cent 
of the Ss recognizing some similarity 
among the anagrams were in the weak 
habit-strength group. Eighty per cent 
were in the strong habit-strength 
group. There were no significant dif- 














STRENGTH OF SET 


ferences in errors between Ss verbal- 
izing a similarity and those that did 
not. 

Forty-eight per cent of the Ss 
reported that they attempted to solve 
the anagrams by some systematic 
method of letter arrangement such as 
vowels around consonants or some 
letter order. Forty-eight per cent of 
the Ss reported that for some of the 
anagrams they tried a method of 
rearrangement of letters while for 
other anagrams the solutions just sud- 
denly appeared. The remaining Ss 
reported that the solutions “just 
came” tothem. These different meth- 
ods of attacking the problems were 
approximately equally distributed 
among the S and W subgroups. There 
was no significant difference in errors 
between the Ss reporting that they 
adopted one method or the other in 
attempting to solve the anagrams. 
Only one S reported that knowing the 
anagrams belonged to the same gen- 
eral class seemed to help him solve the 
anagrams. 

Discussion 


The results of the experiment re- 
ported support the implication con- 
cerning mental set drawn from Hull’s 
formulation of the principle of habit 
strength. The authors’ initial as- 
sumption was that if “mental set” 
designates behavior which can be ex- 
plained in terms of the acquisition of 
differential amounts of habit strength 
by a class of stimuli for the elicitation 
of a particular class of responses, then 
deductions from the principle of habit 
strength should hold for the phe- 
nomena described as mental set. The 
implication tested in the present study 
follows from the negatively accelerated 
growth function of habit strength. It 
is to the effect that a difference in 
strength would develop more rapidly 
between two habit strengths of equal 
but low absolute strength as compared 
with two habit strengths of equal but 
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high absolute strength. The absence 
of a significant difference in Trials 1-5 
indicates that several trials were neces- 
sary before the difference between 
competing reaction potentials in the 
W group became significantly greater 
than the difference between competing 
reaction potentials in the S group. 

As previously indicated, the prob- 
lem of directed thinking is similar in 
some respects to that of the selection 
of the correct response in a trial and 
error learning situation. Descriptive- 
ly, selective learning may occur when 
the ultimately correct response is not 
the dominant response in the initial 
response hierarchy of alternative re- 
sponses elicitable in a given stimulus 
situation. Learning is said to occur 
when the correct response becomes 
dominant in the response hierarchy by 
acquiring the greatest amount of effec- 
tive reaction potential. 

Descriptively, another kind of learn- 
ing, discrimination, is necessary when 
the correct cue or class of cues among 
a number of alternative stimuli is 
initially weaker than the alternative 
stimuli in its tendency to evoke a 
given response. Discrimination learn- 
ing occurs when the correct cues be- 
come dominant by acquiring the 
greatest amount of effective reaction 
potential for elicitation of the given 
response (11). 

A given nature anagram, for exam- 
ple, may serve initially as a stimulus 
for the arousal of nature words, nouns 
in general, verbs, particular letter 
orders, and so on. With repeated 
solutions of nature anagrams these 
other classes of responses are some- 
times reinforced but are also nonrein- 
forced or extinguished on other occa- 
sions. Only the class of nature re- 
sponses is consistently reinforced. It 
therefore becomes dominant, presum- 
ably through mediated generalization 
(1), in the hierarchy of responses elic- 
ited by nature anagrams. However, 
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only nature anagrams are consistently 
reinforced when correct nature re- 
sponses are elicited. No other stim- 
ulus complex is consistently reinforced 
following the elicitation of this class of 
response. Differential reinforcement 
of this class of anagrams will make it 
dominant in the hierarchy of stimuli 
eliciting nature responses. 

Establishment of a mental set of the 
type studied here involves a change in 
both the initial response and stimulus 
hierarchies. It involves both trial and 
error and discrimination learning. 
The behavior described as set is thus 
the result of the acquisition of a dif- 
ferential amount of reaction potential 
by a particular class of stimuli (dis- 
crimination learning) for the elicita- 
tion of a particular class of responses 
(selective learning). 

Thus all the variables determining 
reaction potential and the manner in 
which different reaction potentials 
interact may be determiners of the 
direction of thought. This implies 
that directed thinking is the conse- 
quence of associative factors, which 
may be called mental set, and non- 
associative factors, which may be 
called direction or determining tend- 
ency. This study was primarily con- 
cerned with the former. 

Finally, it may be noted that the 
acquisition of symbolic responses and 
their utilization in new stimulus situ- 
ations may come about as the result 
of discriminatory and selective learn- 
ing as outlined in the present paper. 
The principles employed in the ac- 
count of mental-set phenomena may 
also account for concept formation 
(10, 12). 


SUMMARY 


An experiment employing 40 Ss was 
conducted in order to test an impli- 
cation for the direction of human 
problem solving derived from Hull’s 
assumption that habit strength is a 


negatively accelerated growth func- 
tion. An inverse relationship between 
amount of training with incompatible 
anagrams and subsequent success in 
the solution of one of these classes of 
anagrams was predicted on the basis 
of this assumption. The experimental 
results were in accord with this pre- 
diction. A behavioral interpretation 
of the concept of mental set and 
directed thinking was outlined. The 
establishment of anagram sets was 
discussed in these terms. 


(Received February 23, 1952) 
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A phenomenon similar to reminis- 
cence in pursuit rotor learning has 
been reported by Irion and Gustafson 
(8) when Ss performed with one hand 
prior to the rest interval, but changed 
to the other hand following the rest. 
A similar result has also been obtained 
by Ammons and Ammons (3). Inthe 
Irion and Gustafson experiment, two 
groups of Ss practiced for ten 25-sec. 
trials with the right hand followed by 
ten trials with the left. All trials were 
separated by 5 sec., except that for 
one of the groups a 5-min. rest was 
introduced before the shift in hands. 
The postrest performance with the 
left hand was significantly better for 
the rest group. 

This finding is of considerable inter- 
est because it may cast some light on 
the nature and locus of reminiscence 
and other work and rest phenomena. 
For example, Hull’s theory of reactive 
inhibition, which has been extended to 
motor skill learning by Kimble and 
others (9, 10, 11), assumes that the 
amount of inhibition and hence the 


! This study was conducted in the Perceptual 
and Motor Skills Laboratory, Human Resources 
Research Center, Lackland Air Force Base, San 
Antonio, Texas. The present report was pre- 
pared primarily at the University of Illinois 
under the provisions of Air Force Contract 
33(038)-25726 sponsored by the Human Re- 
sources Research Center. The opinions expressed 
are those of the authors and not necessarily those 
of the Air Force. 


amount of recovery possible is a func- 
tion of the frequency with which the 
specific response involved has been 
made. This would seem to localize 
inhibition in the specific effector sys- 
tem, and the above finding would pre- 
sent an obvious difficulty. Of course, 
there is no assurance that this effect 
and ordinary reminiscence are basi- 
cally thesame phenomenon. One way 
of approaching this problem is to 
ascertain if ordinary and bilateral 
“reminiscence” are similar functions 
of the same variables. In the present 
experiment the two types of gain are 
studied as functions of the length of 
the rest interval. 

The functional relation between 
reminiscence and length of rest in ro- 
tary pursuit learning has been studied 
by Melton (14), Bell (4), Ammons (2), 
Kimble and Horenstein (11) and Irion 
(7). The finding has been that the 
gain is a negatively accelerated in- 
creasing function of the duration of 
the rest period up to a maximum of 5 
or 10 min. Kimble and Horenstein 
fitted a simple exponential function to 
their data. In the present paper, two 
parallel experiments are reported show- 
ing reminiscence and “bilateral rem- 
iniscence”’ as functions of time of rest 
from 10 sec. to 10 min. This is done 
in order that the two functions may 
be compared, and, incidentally, to 
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determine the law for ordinary remi- 
niscence with added precision. 


METHOD 


Subjects.—The Ss were 432 right-handed air- 
men basic trainees selected at random from those 
entering basic training at Lackland Air Force 
Base during a four-week period in August and 
September, 1951. ‘To determine handedness, all 
Ss who said they were right handed were screened 
by a simple test. They were handed a 3-ft. rod, 
and the hand with which they received it was 
recorded. They were then asked to show how 
they would use the rod as a bat, broom, rake, and 
shovel. Only those who were judged to perform 
all tasks in a right-handed manner were used as 
Ss. These men were then assigned at random 
to the experimental groups. 

Apparatus.—The apparatus consisted of 12 
units of the standard Air Force Rotary Pursuit 
Test described by Melton (15). The units were 
arranged in two rows of six with Ss facing each 
other, and all Ss were run in groupsof12. Trials 
and rest intervals were timed automatically by 
electronic timing devices and time on the target 
was recorded for each trial in units of .001 min. 
on Standard Electric timers. A buzzer auto- 
matically sounded 2 sec. before the beginning of 
each trial. 

Procedure.—Practice periods were 30 sec. in 
duration and were separated by 10-sec. rest peri- 
ods. The variable rest periods were introduced 
after Trial 15. In the straight reminiscence ex- 
periment, Ss employed the left hand before and 
after the rest (LL groups). In the bilateral trans- 
fer experiment, Ss practiced with the right hand 
before the rest and the left following the rest 
(RL groups). Separate groups of 36 Ss were 
run under each LL and RL condition. The rest 
intervals, which were the same in the two experi- 
ments, were 10 sec., 30 sec., 1 min., 3 min., 5 
min., and 10 min. All groups received at least 
five trials following the rest, but the 10-sec., 
5-min., and 10-min. groups received 15 trials in 
order that the postrest performance curves could 
be examined. 

Standard Air Force rotary pursuit instruc- 
tions were employed, with only the exceptions 
made necessary by the experimental conditions. 
At the end of the initial instructions, all Ss were 
told, “You may be asked to change hands during 
the test. If you are, just change the stylus to 
the other hand and continue as before. You will 
begin with your right (or left) hand.” Immedi- 
ately at the end of Trial 15, the 10-sec. RL group 
was told, “Change the stylus to your left hand.” 
The 10-sec. LL group continued without inter- 
ruption. At the end of Trial 15, all other groups 


were instructed, “Put down the stylus and rest.” 
Fifteen seconds before the time to resume prac- 
tice they were told, “Pick up the stylus.” Ten 
seconds before time to begin, the RL groups were 
told, “Change the stylus to your left hand.” 
Thus, all of the RL groups had an equal amount 
of time to make the change in hands. Three Es 
were always present. One gave the instructions 
and proctored, while the other two operated the 
equipment and recorded the scores for each trial. 


RESULTS 


If gain as a function of time was to 
be compared for the two situations in 
this study, it was necessary to devise 
some measure which would apply to 
both conditions. The usual method 
of indicating reminiscence by sub- 
tracting some measure of prerest per- 
formance from a comparable measure 
of postrest performance could not be 
applied. The RL groups’ postrest per- 
formance with the left hand was below 
that of the prerest right-hand per- 
formance, while the usual gains were 
shown by the LL groups. The desired 
comparisons actually were between 
gains over predicted postrest massed- 
practice performance. The measures 
presented are of this type. For each 


TABLE 1 


Mean VAtues ror Eacu EXPERIMENTAL 
Group 
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; ota ota ean 
Groups | Trials | Trials | Trials | Gain 
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ii 
10 sec. 545 675 675 0 
30 sec. 499 628 666 38 


1 min. 580 711 890 179 
3 min. 534 664 979 315 
5 min. 671 803 1211 408 
10 min. 628 759 1191 432 





RL 
10 sec. 866 373 373 0 
30 sec. 815 349 405 56 


1 min. 731 311 438 127 
3 min. 838 360 498 138 
5 min. 864 372 598 226 
10 min. 873 376 745 369 
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Fic. 1. Gain as a function of amount of rest. 
The filled circles represent the obtained values 
for the LL groups, and the unfilled circles repre- 
sent the values for the RL groups. 


of the two 10-sec. groups, correlation 
coefficients were computed between 
the total of the last five prerest trials 
and the total of the first five postrest 
trials. Linear regression equations 
were then obtained for each condition 
for predicting the postrest measure 
from the prerest measure. For the 
10-sec. LL group r was .86, and for the 
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regression equations were: 


LL: y = 1.015x + 122 
RL: y= .456x— 22 


The gain scores presented are totals 
of the five postrest trials minus the 
predicted total for massed practice 
(i.e., 10-sec. rest). This measure is 
not only appropriate to the present 
study, but is probably preferable to 
the usual methods of presenting remi- 
niscence data since it takes into consid- 
eration the correlation and variability. 

The basic data in terms of mean 
values for each group are presented in 
Table 1. Gains as functions of rest 
are also presented graphically in Fig. 
1. All data are in units of .001 min. 
on the target. The over-all trends for 
both conditions are highly significant 
as indicated by simple analyses of va- 
riance. Since the analyses were all 
computed from gains over predicted 
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Fic. 2, Learning curves for the 10-sec. groups and the 10-min. groups 
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of analysis of covariance. All values 
were made positive for the analyses by 
the addition of a constant. The F 
value for the LL experiment is 8.88 
and for the RL experiment it is 8.19. 
For the appropriate df these F’s are 
both significant at the .001 level of 
confidence. In general, the gains for 
the LL groups are greater than those 
for the corresponding RL groups. An 
analysis of variance was applied to the 
difference between the two conditions 
yielding an F based on the between- 
rows variance with the within-groups 
variance as the error term. The F 
value of 5.80 is significant at the .05 
level of confidence. 

Learning curves for the two 10-sec. 
and the two. 10-min. groups are pre- 
sented in Fig. 2. Here the gains for 
each condition are illustrated by the 
postrest differences between the corre- 
sponding massed-practice and rest 
groups. It will be noted that for both 
conditions the beneficial effect of rest 
appears to be a temporary phenome- 
non, disappearing as practice con- 
tinues. 


Discussion 


The results clearly indicate that bi- 
lateral “reminiscence” is an increasing 
function of the length of rest at least 
up to 10 min. In this respect, it is 
similar to the function for ordinary 
reminiscence. This fact still does not 
identify the phenomena, but at least 
suggests that they may be subject to 
similar interpretation. At least, the 
bilateral phenomenon requires inter- 
pretation in its own right. Other 
experiments will be necessary to study 
the possibility of similar functions for 
other variables. One thing accom- 
plished by the present finding is to rule 
out the possibility that Irion and 
Gustafson’s massed practice Ss did 
not have adequate time to change 
hands, and merely got off to a bad 


The 10 sec. 
allowed in the present experiment was 


start for that reason. 


constant for all groups. If such an 
artifact had produced the result, gain 
should not be a function of length of 
rest at all. 


Reminiscence as a function of time is such a 
regular function for the LL condition, that a 
curve-fitting attempt was justified. Hull (6) 
assumes that reactive inhibition is a simple ex- 
ponential function of time.* While this theory, 
strictly speaking, is only a theory of classical and 
instrumental conditioning, it has been extended 
to motor skill learning by Kimble and his asso- 
ciates. Kimble and Horenstein (11) were able 
to fit such an exponential function to their data. 
Such a function has also been fitted to the present 
data, and, as may be seen by the upper fitted 
curve in Fig. 1, an excellent fit was obtained. 
The equation is: 


Gain = 480 (1 — 10°27) — 36 


Interpreted in terms of the theory, the value of 
the asymptote (444) is determined by the total 
amount of inhibition accumulated during the pre- 
rest performance. The exponential decay func- 
tion is then subtracted from the asymptote. The 
function indicates that Ss recover from a con- 
stant fraction of the remaining inhibition per 
unit time. The fact that the G intercept is not 
zero but —36 may be explained by the fact that 
the rest period for the massed-practice group was 
10 sec. and not zero. If the gains had been 
computed from a zero-rest massed-practice 
group, the curve could have been made to pass 
through the origin. The closeness of the fit of 
this function certainly supports such an assump- 
tion concerning recovery from inhibition. In 
attempting to extend Hull’s theory into this 
area, however, one should remember that Ellson’s 
(5) spontaneous recovery data, from which the 
postulate was derived, do not approach an asymp- 
totic value until about an hour, while here, 
recovery is about complete in 5 or 10 min. 

The gains for the RL groups were not only 
smaller than for the LL condition, but the func- 
tion shows considerably less regularity. No 
simple negatively accelerated function will pro- 
vide a very good fit. One possible explanation is 
that greater irregularity lies in the nature of the 
gain measures employed. The lower correlation 
between prerest and postrest performance for the 
RL group, means that the gains are computed 


2 Although stated differently, this is the same 
assumption made by Ammons (1) regarding 
his comparable concept of “temporary work 
decrement.” 
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from less reliable predicted values. One plaus- 
ible hypothesis is that the vertical differences 
between the two functions are genuine, but that 
the true functions are otherwise the same. That 
is, they are approaching different asymptotes, 
but each approaches its own limit at the same 
rate. To test this hypothesis, a lower asymp- 
tote was estimated for the RL function according 
to the method proposed by Lewis (12). This 
value was then substituted in the equation and 
the lower curve of Fig. 1 resulted. The new 
asymptote being 310, the equation became: 


Gain = 346 (1 — 10°27) — 36 


A test of goodness of fit was then applied to test 
the hypothesis that the deviations of the ob- 
tained values from this function could be due to 
sampling error. The test, Case 7 suggested by 
Lindquist (13), involves a variance estimate 
based on “departure from hypothesis” and the 
within-groups variance as the error term. The 
test yielded an F of 1.60, which for 5 and 210 
df is insignificant, and indicates that statistically 
the hypothesis is tenable. 

Since the reactive inhibition theory assumes 
that inhibition results from the repetition of a 
specific response, it would at first appear that it 
cannot account for the gains obtained with the 
change in hands. However, it should be remem- 
bered that pursuit rotor performance is a very 
complex task, involving not only movements of 
the hand, arm, and shoulder muscles, but move- 
ments and postural adjustments of the entire 
body, as well as complex perceptual adjustments. 
Proponents of the theory could contend that 
recovery from inhibition localized throughout 
the body might well account for the gains. The 
difference between the asymptotic values of the 
two curves could be taken to represent the 
amount of inhibition localized in the initial hand 
andarm. A more simple and direct explanation 
of the difference between the two curves, how- 
ever, could be simply that enough bilateral trans- 
fer did not occur to produce gains of the same 
magnitude, even though all inhibitory effects 
were dissipated. Reasoning of the above kind 
can be carried on with assurance only when 
something is known concerning the absolute level 
of learning. The present finding does at least 
indicate that there are temporary inhibitory fac- 
tors which may involve gross bodily effects, 
perceptual factors, or perhaps central phenom- 
ena. Furthermore, these inhibitory effects dis- 
sipate as a function of time, perhaps in the same 
manner as the inhibitory effects which account 
for the conventional reminiscence phenomenon. 
Further experiments and theoretical analysis will 
be required to reveal the nature and locus of 
these inhibitory phenomena. 


A further similarity between the two kinds of 
gains may be seen in the learning curves of the 
10-sec. and 10-min. groups in Fig. 2. Here it 
may be seen that the superiority of both rest 
groups over their corresponding massed-practice 
groups diminishes, when practice is continued 
under the massed condition, and is entirely gone 
in from 12 to 15 trials. The difference between 
the two LL groups on Trial 30 is no larger than 
the initial sampling error between the groups. 
The curves for the 5-min. groups, which are not 
presented, show a similar picture with the excep- 
tion of the fact that the gains are smaller. This 
may be taken to indicate that the inhibitory 
factors in both conditions reach a maximum 
value in a relatively short time. This is in 
accord with Irion’s finding that reminiscence as a 
function of amount of prerest practice reaches a 
maximum value after about 10 to 20 25-sec. 
trials. 


SUMMARY 


1. Two parallel experiments were 
conducted in which rest periods of 10 
sec., 30 sec., and 1, 3, 5, and 10 min. 
were introduced after 15 30-sec. trials 
of rotary pursuit performance. In 
one experiment Ss practiced with the 
left hand before and after the rest 
(LL condition), while in the other 
experiment they employed the right 
hand before the rest and the left hand 
afterwards (RL condition). 

2. Gain over predicted massed-prac- 
tice scores was an increasing function 
of amount of rest for both conditions. 
The LL groups showed larger gains. 

3. For the LL conditions gain was a 
simple exponential function of length 
of rest up to about 10min. The com- 
parable function for the RL conditions 
showed less regularity, but the hypoth- 
esis is tenable that it is the same func- 
tion approaching a lower asymptotic 
value. 

4. In both conditions the superiority 
of the rest groups over the massed- 
practice groups disappeared with con- 
tinued practice. 

5. It is suggested that the gains in 
bilateral transfer indicate the presence 
of temporary inhibitory factors either 
associated with generalized postural 
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adjustment or of a perceptual or cen- 
tral character. 


(Received February 25, 1952) 
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That stereoscopic perception of 
depth may exist even when the dis- 
parities between the images of the two 
eyes are large enough that those images 
are perceived double had been known 
before Helmholtz’s time (4, v. 3, p. 
430-431;5, p. 221). Yet much of the 
subsequent literature is not clear on 
this point. Hillebrand (6, p. 205) 
wrote that stereoscopic perception of 
depth occurs only so long as the dis- 
parate images are fused. Matsuda 
(9) presented data to show that under 
low illumination the limits of stere- 
opsis agreed with the limits of fusion. 
Yet those familiar with the Hering 
“drop test” know that the falling bead 
can unequivocally be seen behind or in 
front of the fixation point, even when 
the images of the falling bead are seen 
double. Tschermak (12) took the 
position that fusion of the disparate 
images represents the most favorable 
condition for stereoscopic depth local- 
ization, but not the absolute condition. 

Experiments by Tschermak and 
Hoefer (13), by Aall (1), and by 
Burian (2), to select a few examples, 
tended to show that even relative 
stereoscopic localization could be quite 
precise, even though the test objects 
were seen in double images. Helmholtz 
and others have pointed out, however, 
that if the disparity is too great, stere- 
opsis will not exist. 

Estimates of the limiting disparity 
were made by French (3), who used 


1 Read before the Division of Experimental 
Psychology, at the meeting of the American 
Psychological Association, Chicago, Illinois, 
Sept. 4, 1951. 
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two pairs of vertical lines, the separa- 
tion of the lines in each pair being 
different. Without using a stereo- 
scope, but observing these lines in 
“free” stereoscopy, he found that, 
though two of the lines were perceived 
double, he had stereoscopic depth per- 
ception except when the ratio of the 
separations exceeded 1.6, for then 
depth perception ceased. Previously, 
Prandtl (11) in studying the Panum 
phenomenon, in which a stereoscopic 
depth is perceived when one eye sees a 
single vertical line and the other eye 
sees a pair of vertical lines, reported 
that when the separation of the two 
lines exceeded 1° of arc, the phenome- 
non no longer occurred. Eye move- 
ments were, of course, permitted in 
both these experiments. A test of 
stereoscopic vision was devised by 
Koch (7) during the last war, based on 
measuring the limiting disparity for 
stereopsis. He used simple red-green 
anaglyphs with targets which gave the 
familiar pyramid observed toward the 
apex end. Disparities of as much as 
4° were occasionally reported. Scan- 
ning eye movements were also per- 
mitted in this test. The specifications 
of the targets he used were not given, 
so that the basic limiting capacity of 
the eyes for stereopsis cannot be com- 
puted. Such a pyramid is not a simple 
enough configuration with which to 
make such calculations. No syste- 
matic investigation for the limiting 
values of stereoscopic depth percep- 
tion appears to have been made. Yet 
the existence of such limits is an im- 








254 KENNETH N. OGLE 


portant fact in our understanding of 
the binocular visual processes. 

Experimental technics used to study 
stereopsis usually cannot avoid some 
empirical factors to depth perception. 
As the test object is moved behind or 
in front of the plane of fixation, its 
angular size changes, and its bright- 
ness and color change. Moreover, its 
images in the two eyes blur, which not 
only gives a distance clue but also acts 
as a stimulus for a change in accom- 
modation. The ranges of disparities 
available in these apparatuses were 
also restricted by mechanical limita- 
tions. With the stereoscope unnat- 
ural conditions exist, including per- 
haps such inhibiting factors as texture 
of paper on which the figures are 
drawn. 


INSTRUMENTATION 


The apparatus used in the experiment to be 
reported here eliminated all these empirical fac- 
tors, and permitted a measurement of the range 
of disparities within which stereoscopic depth 
could be perceived in the visual plane, for central 
and peripheral vision. 


Fixation point Fr 





Right eye 


Left eye 


Fic. 1. Relationships between the spatial 
positions of points in space, the corresponding 
visual angles, and the geometrical disparity 
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Fic. 2. Schematic sketch of the apparatus 
used to measure the limits of stereoscopic vision 


Figure 1 shows the geometrical relationships 
between the disparity 6 and the visual angles a 
and a: which subtend the distance F—P to the 
left and right eyes respectively. The disparity 
& = a2 — a as measured here is the geometrical 
disparity, and as such is zero only when the point 
P lies at Po on the Vieth-Miiller circle. The 
physiological disparity will differ slightly from 6, 
but the error in neglecting the difference even 
for a peripheral angle of 10° is only of the order 
of 15’ of arc (10). In view of the magnitudes of 
disparity measured with double images, this error 
will be neglected and the geometrical disparity 
used for the physiological. 

The experimental apparatus used is illustrated 
schematically by the perspective sketch in Fig. 2. 
A fixation point consisting of a small illuminated 
spot, .25 mm. in diameter, is attached to a large 
sheet of thin plate glass, 50 cm. from the eyes. 
By reflection from special mirrors (50% trans- 
mission—50% reflection) at about an angle of 
45° the eyes also see the images of a test object. 
The test object used for the data presented here 
was a thin, highly polished needle (about 3 cm. 
in length) suitably illuminated by a small pro- 
jection lamp. The two mirrors can be made to 
rotate in opposite directions about axes 4-4, by 
means of levers and a screw arrangement. A 
dial indicator shows the degree of turning of the 
two mirrors. A rotation of the mirrors intro- 
duces a disparity between the images of a point 
seen by the two eyes symmetrically about its 
position when lying on the Vieth-Miller circle 
(when the angle of disparity is zero). By turn- 
ing the arm that supports the test object the 
disparity measurements can be made for a cen- 
tral and any peripheral visual angle. This instru- 
ment makes possible a large range of disparities 
of the images of the test object, and within this 
range the images do not change brightness or 
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STEREOSCOPIC VISION 


angular size or become blurred. With a suitable 
headrest and chin cup, head movements are prac- 
tically avoided. Horizontal slit apertures before 
the eyes and suitable distant backgrounds restrict 
the binocular visual field to the fixation point 
and the images of the test object. The experi- 
ment is conducted in a semidarkened room. 
Hence empirical clues to spatial localization are 
eliminated. 

For a given experiment the observation of 
stereoscopic depth between the test object and 
the point of fixation corresponding to the angle 
of disparity could be made in three ways: (a) By 
means of a motor control, the mirrors could be 
rotated continuously and thus the disparity in- 
creased in one direction continuously. Or the 
change in disparity could also be introduced 
manually by a suitable knob (see Fig. 2), when 
the motor control is disengaged from the appar- 
atus. (b) The images of the test object could be 
flashed (exposed) momentarily, by a tap key or 
telechron timing switch inserted in the circuit for 
illuminating the test object. (c) The image of 
the test object seen by one eye (its half-image) 
could be continuously seen while the half-image 
seen by the second eye could be exposed at the 
will of S, by means of a suitable sliding aperture 
mounted between the test object and the mirrors. 

With steady fixation, the stereoscopic be- 
havior of the images of the test object could 
thus be observed for given changes in the dis- 
parity, continuous exposure, momentary expo- 
sure and exposure of one of the half-images. 
Muscle imbalance between S’s eyes must be 
small in order that accurate fixation on so small 
a detail can be held. The principal difficulty is 
to hold fixation steady, while observing double 
images and stereoscopic depth. Some prelim- 
inary training is usually necessary. 


RESULTS 


Subjective experience of stereoscopic 
depth.—Using the several procedures 
outlined in the preceding paragraphs, 
the experience of stereoscopic depth, 
observed as the disparity is increased 
in one direction, seems to fall into 
three ranges of disparity. 

The test object is first seen at the 
same distance as is the fixation point. 
Then while an increasing uncrossed 
disparity is introduced, the image 
appears to move back, and S is aware 
of a strong stereoscopic sense of depth 
difference between fixation point and 
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object—a sense of plasticity. The 
test object appears to move back pro- 
portionally to the increase in disparity. 
Then the images suddenly appear as 
two. At this point the disparity be- 
tween the images has reached the outer 
boundary of the region of binocular 
single vision, and the disparity at this 
point corresponds to the horizontal 
extent of Panum’s fusional area (10). 

The stereoscopic sense of depth con- 
tinues to increase, however, as the 
disparity increases, though the image 
of the test object is seen double. The 
separation of these images, of course, 
continually increases also. Then rather 
suddenly the strong sense of depth 
fades, and the depth interval no longer 
increases. Yet the two half-images 
still appear farther away than the fixa- 
tion point. This disparity represents 
the limit of a patent stereoscopic 
experience. 

With a continued increase in dis- 
parity, the separation of the double 
images also increases, and while no 
real sense of stereopsis exists, still the 
two images unmistakenly appear farther 
than the fixation point. Then rather 
abruptly even this impression ceases, 
and each of the two half-images ap- 
pears indefinitely localized, and their 
separation merely increases. It may 
be questioned whether within this last 
range of disparities there is a true 
stereoscopic sense; yet there certainly 
is still a cerebral interaction between 
the images as seen by the two eyes, 
which gives rise to an impression of 
depth. 

If the disparity is increased in the 
opposite direction, the stereoscopic 
perception of depth is one of nearer 
than the fixation point, but the same 
regions can be differentiated. 

Measurement of the limits.—For the 
most part, the measurements of the 
disparities corresponding to the limits 
discussed in the preceding paragraphs 
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were obtained by the method of adjust- 
ment. Inseveral instances the method 
of constant stimuli was also used, pri- 
marily to learn that such a procedure 
could be applied to the determination 
of these limiting disparities. There 
can be no doubt that these limiting 
disparities are not sharply defined, and 
under given conditions there would be 
a probability function of experiencing 
or not experiencing depth perception 
for a particular range of disparities. 
To a certain extent the limiting dis- 
parities vary with the conditions of 
the experiment—general illumination, 
size of test object, fatigue of S—and 
probably somewhat with training. 
Failure to keep exact fixation is a 
small source of error, in that this 
would effectively increase the range of 
stereoscopic depth perception between 
limiting disparities. The amount of 
fusion detail in the fixation object as 
used here is less than that of the test 
object. Hence the disparity between 
the images in the test object seen by 
the two eyes would provide a strong 
stimulus for a fusional movement, and 
this would tend to weaken the hold of 
fixation on the dot. Momentary ex- 
posure of one or both of the half- 
images tends to eliminate this problem. 


Two Ss served as observers. One of the two 
was a trained observer of stereoscopic phenom- 
ena; the other was entirely lacking in such 
experience. The data were obtained in three 
ways as regards the exposing of the test object, 
namely: 

1. The stereoscopic depth experience was ob- 
served for a continuous and regular change in 
disparity introduced by the electric motor driv- 
ing arrangement. This movement was abruptly 
stopped the instant a given limit was judged to 
have been reached. A braking arrangement 
assures positive stopping of the movement. 

2. The images of the test object were momen- 
tarily exposed by a tap key, following adjustment 
of the disparity either by motor control or by 
manual operation. The duration of the exposure 
was about a third of a second. 

3. With one of the half-images continuously 
seen, the second half-image was momentarily 


exposed by means of a manual adjustment of the 
sliding aperture, again as the disparity was 
changed manually or by motor control. Some- 
times a suppression of the unoccluded image 
occurred when the occluded second half-image 
was exposed. This of course interferes with the 
judgment of stereoscopic depth, and this phe- 
nomenon seems to occur more often when the 
disparity is near the limits of a patent stereo- 
scopic sense of depth. For disparities not too 
large, it was a striking phenomenon to observe 
that the continuously seen half-image would be 
indefinitely localized, or localized at the will of S, 
yet when the second half-image was exposed, 
both would appear to jump back (or forward) in 
space stereoscopically under the stimulus of the 
disparity. 


The data for the limiting disparities 
corresponding to the range of patent 
stereoscopic depth and those corre- 
sponding to the additional range 
where each of the double images is 
perceived qualitatively in depth are 
illustrated graphically in Fig. 3, for 
central vision and for peripheral 
angles out to about 7° of arc. The 
points plotted on the graph represent 
the average of the “far” (uncrossed) 
and the “near” (crossed) limiting dis- 
parities. This set of data was ob- 
tained by the untrained S. The mag- 
nitudes of the limiting disparities for 
the trained S were on the whole some- 
what larger than for the untrained S. 
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STEREOSCOPIC VISION 


The disparities corresponding to the 
limits of fusion (and therefore the hori- 
zontal dimension of Panum’s areas) 
are also shown, these being represented 
by the set of data lowest on the graph. 
The limiting disparities which define 
the range within which there is a 
patent stereoscopic perception of depth 
are shown by the two sets of data next 
higher on the graph, the lower set 
being the data obtained by momen- 
tarily exposing one of the half-images, 
the upper set being those obtained by 
momentary exposure of both half- 
images simultaneously. The set of 
data highest on the graph is for those 
limiting disparities which define the 
range where a qualitative depth of 
both of the double images is appreci- 
ated. These last limiting disparities 
varied less with the method of expos- 
ing the half-images. These limits 
indicate the largest disparities within 
which an interaction occurs between 
the neurological and cerebral processes 
of the two eyes. 

The limiting disparities for patent 
stereopsis and those for a qualitative 
depth for double images differ some- 
what with the time of exposure of the 
disparate images. Certainly with con- 
tinued observation of the double im- 
ages, the stereoscopic perception may 
fade and the images may be indefi- 
nitely localized—a fact mentioned in 
the literature. It is believed that the 
data shown here are quite represen- 
tative of the magnitude of the limiting 
disparities. 

A study of these data shows that 
even at the maculae of the two eyes 
there is a range of disparities outside 
of which there is no patent stereo- 
scopic perception of depth, a range 
which is only slightly greater than the 
size of Panum’s areas for the same 
region of the retinas. The measure 
ments here of Panum’s fusional area 
are of the same magnitude as those 


257 


values previously found (10). A quali- 
tative appearance of depth of the two 
double images increases the range by 
about 5’ of arc. Since the experi- 
mental arrangement of the apparatus 
causes these disparities to increase 
symmetrically with respect to the fix- 
ation point, the cortical representation 
of the two half-images would be, 
according to presently accepted the- 
ories, on the opposite lobes of the 
occipital cortex. 

The range of disparities for patent 
stereopsis increases quite rapidly with 
the peripheral angle, and the range for 
a qualitative appreciation of depth of 
the double images increases even more 
rapidly. 

The range of disparities within which 
there is a patent sense of stereoscopic 
depth is larger when the half-images 
are exposed simultaneously than when 
one of the half-images is seen con- 
tinuously and the other is exposed 
momentarily. In the case of the simul- 
taneous exposure of the two images, 
a subjective identification of the two 
images as belonging to the same exter- 
nal object might be the cause of this 
difference, or it might be a “retinal 
saturation” (8) from the continuously 
seen half-image which inhibits the 
binocular interaction with the second 
half-image when it is exposed. The 
suppression of the continuously seen 
half-image occurs much more fre- 
quently in the exposure of one of the 
half-images. To prevent after-images 
and retinal fatigue, S was asked to 
look away from the fixation point 
between observations. 

The spatial regions corresponding to 
the ranges of disparities of Fig. 3 can 
be obtained by computation.* These 


Very approximately the range of depth 
(Ab) between the fixation point and the boundary 
given by a limiting disparity (5) can be calculated 
from the relationship Ab = 575/24 in which 3 is 
the observation distance and 2a is the inter- 
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Fic. 4. The spatial regions corresponding to 
the several limits of stereoscopic depth percep- 
tion calculated from the data of the trained S 


regions calculated from the data for 
the trained S are shown in Fig. 4. 
An average of the disparities for the 
limiting values for the range of patent 
stereopsis was used. The region cor- 
responding to binocular single vision 
(Panum’s areas), that to patent stereo- 
scopic depth and finally that to a 
qualitative depth perception with 
double images can be easily identified. 
Within the wedge-shaped (uncross- 
hatched) regions symmetrical with the 
median plane, in front and behind the 
fixation point, no stereoscopic depth 
perception would exist. 

If one assumes that these limiting 
disparities are constant for all obser- 
vation distances, then as the distance 
of the fixation point from S increases, 
the size of these spatial areas will also 
increase greatly—nearly with the 
square of the observation distance. If 
the eyes are fixed on a very distant 
object, the range of patent stereoscopic 


pupillary separation. Specifically in the experi- 
ment here Ab = .12 6 cm., when 4 is expressed in 
minutes of arc. 


depth perception at a peripheral angle 
of about 6° will be from infinity to a 
distance of about 10 ft. from S. The 
effective range of patent stereoscopic 
vision will be increased if eye move- 
ments are permitted and an interme- 
diate fixation point is selected. 

The data presented here do not 
imply that there is a precise stereo- 
scopic spatial localization from double 
images (though certain experiments 
cited in the literature would indicate 
this), but they do show that there is a 
stereoscopic perception of depth over 
a wider region than previously thought 
possible. Within one’s ordinary sur- 
roundings, of course, the empirical fac- 
tors of distance localization play a 
marked role independent of stereo- 
scopic depth localization. 

If stereopsis were entirely a psycho- 
logical phenomenon, no limiting ranges 
of disparities as found would be antici- 
pated. The fact that these finite 
ranges exist certainly is evidence for 
the existence of neuro-anatomical lim- 
itations, and probably a physiological 
basis for the stereoscopic perception of 
depth. The extent to which the neural 
paths that originate in the retinas are 
multiplied and, more pertinently, the 
extent to which these fibers overlap in 
the terminal regions of the occipital 
cortex (or some other area in the brain) 
would provide a basis for the neuro- 
anatomical limitations. Because at 
the macular regions of the two eyes a 
limiting disparity for stereopsis of the 
order of 10 to 20’ of arc is found, the 
neuro-anatomical overlapping prob- 
ably cannot be, at least wholly, in the 
occipital areas of the cortex, because 
the half-images would be represented 
on opposite lobes of the occipital 
cortex. 

Any generalized theory of space per- 
ception based on binocular vision must 
take into account the existence of 
these limiting disparities and their 
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corresponding region in space, outside 
of which there is no stereoscopic depth 
perception. 


SUMMARY 


The fact that stereoscopic percep- 
tion of depth is possible from dis- 
parate images which are seen double 
has been known since the middle of the 
last century. No systematic investi- 
gation of the limiting disparities within 
which such stereopsis exists, however, 
has been reported. In this paper an 
apparatus is described which permits 
such limiting disparities to be found 
without its introducing at the same 
time any of the usual empirical factors 
to spatial localization. Data on two 
Ss show that there are two ranges of 
depth with double images, the one in 
which there is a strong patent experi- 
ence of stereoscopic depth, the second 
one in which there is a qualitative 
sense that both double images are 
seen farther or nearer than the fixation 
point, depending on the direction of the 
disparity. Measurements are made 
by the method of adjustment for the 
limiting disparities corresponding to 
these ranges. For a patent stereopsis 
these disparities range from about 10 
minutes of arc at the maculas to 70 
minutes of arc at a peripheral angle of 
6 arc degrees. The qualitative sense 
of depth of the two double images 
ceased at disparities of about 15 min- 
utes of arc at the maculas to nearly 
2 arc degrees at a peripheral angle of 
6 arc degrees. The extent of these 
regions has never been appreciated 
before. The existence of these limit- 
ing disparities is taken as evidence of 
neuro-anatomical limitations to be 
accounted for on the basis of the mul- 
tiplication and overlapping of neural 
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fibers in the terminal region of the 
occipital cortex or some other area of 
the brain. 

These limiting disparities and the 
corresponding region in space outside 
of which no stereoscopic vision exists 
must be taken into account in any 
general theory of space perception 
based on binocular vision. 


(Received March 4, 1952) 
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INFORMATION THEORY AND IMMEDIATE RECALL 


MURRAY ABORN AND HERBERT RUBENSTEIN 
Michigan State College 


As ordinarily used, the word infor- 
mation involves the idea of a message 
adding something to the addressee’s 
knowledge. In communication the- 
ory, the concept of information is more 
restricted: only when the addressee 
learns something from a particular 
segment of message which could not 
be learned from the context in which 
the segment occurred, is the segment 
said to contain information. The 
term information will be used only in 
this restricted sense throughout this 
study (2; 3, p. 74-112). 


The relation between context and information 
may be clarified by the following example. Sup- 
pose we heard the utterance “The door was 
hermetically ——.” There would be no diffi- 
culty in getting the import of the utterance. 
We would automatically supply the word 
“sealed” since the occurrence of this word could 
be predicted with certainty from the portion of 
the utterance which we heard, i.e., from its con- 
text. “Sealed” is the only word which can occur 
in the context “The door was hermetically: “hag 
Since the utterance “The door was hermeti- 
cally ——” functions just as well as “The door 
was hermetically sealed,” we can say that 
“sealed” adds no information in this context. 

Suppose we heard the fragment “The letter 
was .”’ We could not be certain what the 
next word was. It could be “sealed,” but it 
could also be “written,” “mailed,” “here,” “on,” 
etc. Since the occurrence of “sealed” here can- 
not be predicted with certainty from the context, 
it is clear that its presence tells us something we 
would not know if it were absent. Its presence, 
therefore, conveys information. 








The information contained in a seg- 
ment, then, is related to the probabil- 
ity of predicting its occurrence from 
what has gonebefore. The probability 
of predicting the occurrence of a cer- 


1 Only the portion of the message preceding 
the segment in question is considered relevant in 
computing the amount of information contained 
in the segment. 


tain word in a given context is, in 
turn, partially dependent on the num- 
ber of words which may occur in that 
particular context. Mathematically 
stated, the amount of information con- 
tained in any given segment is equal to 
the logarithm to the base 2 of the number 
of alternatives which can occur in the 
given context. Thus, given a segment 
occurring in a context in which 15 
other segments can replace the given 
segment (so that counting the seg- 
ment which does occur, there are 16 
alternatives for that particular con- 
text), we would say that the informa- 
tion value of the given segment in that 
context is log, 16 or 4 bits. The bit 
(a telescoping of binary digit) is the 
unit of information. 
This is not the whole story. Given 
the context “I saw a very interesting 
,’ the statement would contain 
more information if the blank were 
filled by “antimacassar” than if it 
were filled by “play” since the occur- 
rence of “antimacassar” is far less 
probable in this context than “play” 
is (and consequently far less predict- 
able). If the alternatives in a given 
context differ in probability of occur- 
rence, these differences must be taken 
into account in computing the amount 
of information that each of them con- 
tains. Such differences are always 
found in social languages. Since no 
data are available on probability of 
occurrence for any language, a lin- 
guistic information study at present 
can be carried out accurately only in 
acultural sequences such as nonsense 
material. The mathematical formula 
given above, which assumes the equal 
probability of all alternatives, is fully 
applicable to such material. 
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To sum up the concept of informa- 
tion, we can say: (a) The amount of 
information conveyed by a particular 
segment can be calculated only in 
reference to a particular context. (5) 
It is a logarithmic function of the 
number of alternatives which may 
occur in that context. The greater 
the number of alternatives, the greater 
the amount of information conveyed 
by the given segment. Thus, ran- 
domness, i.e., freedom of occurrence 
of the segments composing our uni- 
verse, results in high information 
values. Information and organiza- 
tion, i.e., limitation on the occurrence 
of the segments composing our uni- 
verse, stand in an inverse relationship. 
(c) Differences in probability of occur- 
rence, where they exist, must be taken 
into account in the calculation of the 
amount of information. Differences 
of this sort are found in all systems 
of social communication. Information 
studies in meaningful language aiming 
beyond rough approximation must 
await further data on these prob- 
abilities. 


STATEMENT OF THE PROBLEM 


The present study seeks to apply 
the concept of information to the 
problem of learning as measured by 
immediate recall. Specifically, infor- 
mation theory was used to furnish 
rules under which test passages of 
varying degrees of contextual organi- 
zation were constructed. By the use 
of these rules, the information con- 
veyed by the passage could be speci- 
fied exactly. 

A different, though related, gauge 
of organization was used by Miller and 
Selfridge (1), who graded the organi- 
zation of their test passages according 
to the approximation to English. 
While this method enabled them to 
deal with meaningful material, degree 
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of approximation to English, especially 
as worked out in their study, is not 
nearly so accurate a measure of con- 
textual limitation as is provided by 
information theory. 

The organization of our material is 
achieved by progressively limiting the 
context in which the various nonsense 
syllables can appear. The limitations 
applied in each passage constitute the 
organization pattern of the passage. 
The degree of organization is measured 
by the average rate of information per 
syllable. The average rate of informa- 
tion is computed by dividing the total 
amount of information contained in the 
given passage by the number of syllables 
in the passage. If the organization 
patterns are utilized by the Ss, they 
should be able to recall a greater num- 
ber of syllables in the more highly 
organized passages. Since, however, 
the rate of information per syllable is 
in inverse proportion to the degree of 
organization, the information score 
should be the same for all passages. 
More formally stated then, the hy- 
potheses of this study are: (a) The 
amount of learning in terms of syl- 
lables recalled is greater as the organi- 
zation of the passage is greater, i.e., as 
the average rate of information is 
smaller. (6) The amount of learning 
in terms of the information score, 
computed as the product of the num- 
ber of syllables recalled and the aver- 
age rate of information, is constant for 
all passages. 


METHOD 
Structure of the Nonsense Syllables 


A vocabulary of 16 readily pronounceable 
nonsense syllables was constructed. These syl- 
lables were: 


Class 1 Class 2 Class 3 Class 4 
ZAP YAV KER EMP 
ZAS YAD KEZ ERT 
ZOP YOM KIF IPS 
ZOT YOG KIB INF 


In order to permit the setting up of simple 
rules of distribution, the nonsense syllables were 
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divided into four classes, each containing four 
syllables. To facilitate their learning, the syl- 
lables were constructed so that the membership 
of each class was characterized by some formal 
feature: Class 1 syllables all began with Z, Class 2 
syllables with Y, Class 3 with K, and Class 4 
with Eorl. The other formal features may be 
seen by inspection. 


Structure of the Passages 


Six passages were constructed, all about of 
the same length, 30-32 syllables.? All syllables 
were employed with approximately the same 
frequency in the passages so that the differences 
in probability of occurrence played no role in the 
calculation of the amount of information. The 
passages differed from each other in the degree of 
organization, i.e., more limitations were imposed 
on the distribution of the syllables as one went 
from Passage 1 to Passage 6. 

In Passage 1 the syllables were presented ran- 
domly. This meant that at any point in the 
passage, S had to choose his syllable from 16 
alternatives. The rate of information per syl- 
lable was therefore 4 bits. 

In Passage 2 a low degree of organization was 
introduced: The syllables were divided by com- 
mas into groups of two. Within each group, the 
second member belonged to the next higher class 
than the first. Thus, if the first syllable of the 
group belonged to Class 1, the second belonged 
to Class 2. If the first belonged to Class 2, the 
second belonged to Class 3, and so on. The 
progression was in the form of a circle so that 
when the first syllable belonged to Class 4, the 
second syllable belonged to Class 1. In selecting 
the first syllable of any set, S again chose from 
16 alternatives (4 bits). In deciding on the 
second number of the set, however, he had to 
choose from only four alternatives (2 bits) since 
he could infer the class to which this syllable 
belonged from his knowledge of the class of the 
first syllable. The average rate of information 
in this passage was, therefore, 3 bits per syllable. 

The syllables were again marked off into 
groups of two in Passage 3. There were only 
two kinds of groups. One consisted of a Class 1 
syllable followed by a Class 2 syllable while the 
other consisted of a Class 3 syllable followed by a 
Class 4 syllable. The kinds of groups occurred 
in random order. The number of alternatives, 
then, from which S chose the first member was 
eight (3 bits), i.e., he chose from the membership 
of two classes, 1 and 3. The number of alter- 
natives involved in his choice of the second mem- 


2 Passages 1, 2, and 3 each contained 30 syl- 
lables while 4, 5, and 6 each contained an addi- 
tional two syllables in order that the passages 
might not end with an incompleted pattern. 


ber was only four (2 bits) since the class of this 
member is inferable from the class of the first 
member. Thus, the average rate of information 
was 2.5 bits per syllable. 

In Passages 4, 5, and 6, the groups were four 
syllables long. Each group of four syllables was 
marked off by commas. Within each group, the 
first syllable belonged to Class 1, the second to 
Class 2,and soon. This was the total organiza- 
tion in Passage 4. At any point in Passage 4, 
then, since it was possible to infer the class of the 
sought-after syllable from the class of the pre- 
ceding syllable, S chose from four alternatives. 
The rate of information in this passage, then, was 
2 bits per syllable. 

The additional restriction in Passage 5 was 
that adjoining syllables should not contain the 
same vowel-letter. This restricted the choice in 
Classes 2 and 4 since these classes contained 
syllables with the same vowel-letters as 1 and 3 
respectively. The choice in Classes 1 and 3, 
however, remained unrestricted since none of the 
syllables in these classes contain the same vowel- 
letters as the classes they followed (Classes 4 and 
2 respectively). Thus, S was faced with four 
alternatives (2 bits) in selecting a syllable which 
he knew to belong to Class 1 or to Class 3. In 
selecting a syllable which he knew to belong to 
Class 2 or 4, on the other hand, he had to choose 
from only two alternatives (1 bit) since two 
alternatives were ruled out as containing the 
same vowel-letter as the preceding syllable. The 
average rate of information in this passage was 
1.5 bits per syllable. 

Passage 6 had the same organization as Pas- 
sage 4 plus the following restrictions: The vowel- 
letters in the odd-numbered groups were AAEE, 
e.g., ZAP YAV KEZ EMP, while the vowel- 
letters in the even numbered groups were OOII, 
e.g., ZOP YOG KIF INF. Selection at any 
point, consequently, was limited to two alterna- 
tives. The rate of information in this passage 
was | bit per syllable. 


Correctness of Recall 


Two criteria had to be satisfied for a syllable 
to be considered correctly recalled: (a2) unam- 
biguous reproduction of the form of the syllable, 
and (bd) accurate reproduction of the position of 
the syllable. 

Since the information calculations were only 
on the syllable level and not on the letter level, 
ie., there was no concern with, say, the number 
of alternatives from which the S selected P after 
the ZA, there was an attempt to facilitate the 
correct reproduction of the syllable forms as 
much as possible. Thus, each S was required to 
memorize the inventory of syllables before the 
study was started. Further, this inventory was 
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printed on the top of each answer sheet for S to 
refer to at will. Errors in reproduction of form 
were, consequently, very few. If the reproduc- 
tion was unambiguous, though inaccurate, e.g., 
AMP for EMP, credit was given provided, of 
course, the second criterion of correctness was 
also satisfied. 

Position could be acceptably reproduced in 
either of two ways: (a) By reproducing the syl- 
lable in the same numerical position as it was in 
the passage. Numerical position, which can be 
considered an aspect of context, was completely 
adequate for the calculation of the information 
conveyed by the given syllable. Accordingly, if 
S reproduced ZAP correctly as the third syllable 
of the passage, it was considered correct even if 
the first and second syllables were incorrectly 
reproduced. (b) In a sequence of syllables which 
correctly reproduced the relative order of the 
passage syllables, the position of all syllables 
other than the first was considered adequately 
represented even though this position was in- 
correct numerically. The first syllable of such 
sequences had to be considered incorrect since it 
was improperly selected from the viewpoint of 
its preceding environment. Since some control 
over the order of the group as a whole was con- 
sidered necessary, the degree of deviation from 
the numerical position was limited to a 4-syl- 
lable disparity. (In the last three passages, this 
was tantamount to saying that the reproduced 
sequence could be no more than one group away 
from where it was in the passage.) Thus, if 
syllables occupying positions 13, 14, 15, 16 were 
recalled as occupying positions 21, 22, 23, 24, the 
reproduction would be considered totally incor- 
rect. If, however, they were recalled as occupy- 
ing positions 17, 18, 19, 20, the syllables in posi- 
tions 18, 19, 20 would be considered correctly 
reproduced. 


Procedure 


Group I.—The Ss were all undergraduate stu- 
dents attending a course in psychology. As a 
group, they were quite homogeneous with respect 
to age, level of intelligence, and educational back- 
ground. There were about twice as many males 
as females. The Ss were allowed about three 
days in which to memorize the nonsense syllables 
and the characteristics of the classes into which 
they were divided. At the end of that time, they 
were given a criterion test consisting of items 
ranging from simple class identifications to the 
construction of syllable sequences based upon 
complex class combinations. Only 26 of the 
initially selected 40 Ss were subsequently judged 
sufficiently proficient in the manipulation of the 
syllables to participate in the experiment. These 
26 Ss compose Group I. 
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In order to minimize the possibility of nega- 
tive transfer between the organization patterns, 
Group I was administered one passage per class 
meeting, i.e., three to four days apart. To nullify 
the effect of practice, the order of the passages 
was randomized. 

Each S was given a sheet containing the 
organization pattern of the passage on which the 
Ss were to be tested and allowed a 10-min. study 
period to fix them in his mind. At the end of the 
study period, the first sheet was removed from 
sight; each S received a sheet containing the 
actual passage to be memorized, and allowed 
3 min. to learn as much of it as he could. When 
3 min. had elapsed, the passage was removed 
from sight and Ss recorded as much of the pas- 
sage as they could recall. This procedure was 
followed for all six passages. 

Group II.—The data obtained with Group I 
suggested certain modifications in design which 
might serve to reduce variability. The change 
consisted of having Ss take all passages in one 
sitting. ‘To minimize fatigue, only four of the 
passages were employed (1, 2, 4, and 5). In 
other respects, the experimental procedure was 
identical to that followed with Group I. 

In a new group of 20 students who were given 
the syllables to study, only seven were found 
sufficiently proficient to participate in the experi- 
ment. These seven Ss compose Group II. 


RESULTs AND Discussion 


The data obtained for the perform- 
ance of Group I are given as part of 
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syllables correct are presented for each 
passage. They represent the raw 
scores from which the information 
scores were computed. As hypothe- 
sized, the means for number of syl- 
lables correct show a_ progressive 
increase in magnitude from Passage 1 
(no organization) through Passage 6 
(high organization), with the excep- 
tion of Passage 5. Since Ss took the 
passages in random order, the increase 
could not be the effect of practice. 

Were the increases proportionate to 
the average rate of information con- 
tained in each passage, the mean infor- 
mation scores would remain constant 
from Passages 1 through 6. As indi- 
cated in Fig. 1, the mean information 
scores fluctuate from passage to pas- 
sage; the variation among means for 
Passages 1 through 4 being relatively 
slight (within a range of 3 score units), 
and that for Passages 5 and 6 being 
relatively large. In the case of Pas- 
sage 5, the mean number of syllables 
correctly recalled shows no increase 
over the mean number recalled in Pas- 
sage 4 and the mean information score 
drops considerably. In the case of 
Passage 6, the mean information score 
shows a further decrease despite the 
large increase in the mean number of 
syllables correctly recalled. An infor- 
mation score commensurate with that 
achieved for Passage 1, for example, 
would require an average recall of 26 
syllables compared with the 16.5 
accomplished. 

According to the hypotheses of this 
investigation, the organizational struc- 
ture of the six passages would enable 
Ss to memorize the syllables in greater 
quantity as the average rate of infor- 
mation becomes less. Moreover, the 
increment in the number of syllables 
learned for any given passage would 
result in an information score for that 
passage equal to the information score 
achieved for the previous passages. 


TABLE 1 


ANALYsIs OF VARIANCE OF INFORMATION 
Scores or Group I For 
Passaces 1, 2, 3, anp 4 
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Variation af Square F 
Passages 3 53.33 _ 
Subjects 25 113.28 1.34 
Interaction 75 84.48 














The data in Fig. 1 testify that this did 
not occur. The means for Passages 5 
and 6 seem particularly divergent, and 
the use of independent ¢ tests for 
related measures showed that both are 
significantly different from Passages 1, 
2, 3, and 4. In the case of the first 
four passages, however, the differences 
among means seem within the range 
of chance fluctuation. The notion 
that the data for these passages are 
statistically homogeneous is, in fact, 
a tenable one. When the information 
scores for Group I are subjected to an 
analysis of variance, the results (Table 
1) indicate that neither passages nor 
subjects contribute significant vari- 
ance when tested against the inter- 
action variance. 

The available evidence shows, then, 
that the amount of information con- 
tained in the material recalled was 
constant in the less organized passages. 
In the more highly organized passages, 
where the average rate of information 
fell below a critical value, the amount 
of information contained in the mate- 
rial recalled was reduced. 

This is, in essence, the same finding 
as revealed by Miller and Selfridge (1), 
who observed that raising the orders 
of approximation affected the per- 
centage of recalled material only up to 
a certain point. For example, in the 
ten-word passage the percentage re- 
called does not increase appreciably 
above the second order of approxima- 
tion while in the 50-word passage the 
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percentage recalled does not increase 
appreciably above the fifth order of 
approximation. 

The results of Group II are given as 
part of Fig. 1. The curves for Group 
II reflect a higher level of performance 
but are otherwise similar to those for 
Group I in most respects. There is 
the same “leveling off” of the curve 
for syllables correct after Passage 4 
and the same consequent drop in the 
information curve. The curve for 
syllables correct shows a more con- 
sistent acceleration from Passages 1 
through 4, however, and the informa- 
tion curve is correspondingly straighter 
(the mean information scores fall 
within a range of $ of 1 score unit). 

The results of an analysis of vari- 
ance (Table 2) bear out the impression 
gained by an inspection of Fig. 1. 
The mean information scores for 
Group II show no significant variation 
for Passages 1 through 4. On the 
other hand, much less information was 
contained in the material recalled from 
Passage 5. The difference between 
Passage 5 and the less organized pas- 
sages was found to be even more sig- 
nificant with this group than it was 
with Group I. Individual differences 
among the Ss reach a barely signifi- 
cant level in this group. This appears 
to be a consequence of a somewhat 
greater range in individual ability in 
the group of Ss chosen. 


TABLE 2 


ANALYsIs OF VARIANCE OF INFORMATION 
Scores or Group II ror 
PassaGEs 1, 2, anp 4 











Sor f M 

Verietion af P omen F 
Passages 2 50 — 
Subjects 6 237.00 4.01* 
Interaction 12 59.17 








* Significant at the .05 level. 
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In general, the results with Group II 
confirm the findings with Group I 
rather well. The altered procedure 
was moderately successful in reducing 
variability and the same results emerge 
in spite of the small number of Ss. 


SUMMARY AND CONCLUSIONS 


The present paper reports an at- 
tempt to investigate the influence of 
the degree of organization upon the 
ability of Ss to recall lists of syllables 
immediately after learning. A group 
of Ss was made familiar with a small 
inventory of nonsense syllables and 
later learned various rules of organi- 
zation, each set of rules progressively 
limiting the syllables which could 
appear. They then tried to memorize 
passages of between 30-32 syllables 
chosen in conformity with the rules 
learned previously. Organizational 
arrangements employed ranged from 
random order (high average rate of 
information) to high organization (low 
average rate of information). 

1. More syllables are correctly re- 
called from a passage with a lower 
average rate of information than from 
a passage with a higher average rate 
of information. 

2. Up to a certain degree of organi- 
zation, between 2 and 1.5 bits per 
syllable, the amount of information 
learned by the Ss was constant. 

3. Beyond a certain degree of organi- 
zation, about 1.5 bits per syllable, the 
relationship between the quantity of 
material recalled by Ss and the aver- 
age rate of information contained in 
that material tended to become dis- 
proportionate. Although an increas- 
ingly greater number of syllables was 
correctly recalled, the total amount of 
information contained in the syllables 
was smaller than in the less organized 
passages. 

4. Up to a certain degree of organi- 
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zation, the quantity of material which 
S can recall can be predicted from a 
knowledge of the average rate of infor- 
mation of the given passage and S’s 
level of ability as measured in a pas- 
sage having a different average rate of 
information. 


(Received March 6, 1952) 
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INSTRUCTIONAL SETS AND HABIT INTERFERENCE 


MONCRIEFF H. SMITH, JR. 


University of Washington ' 


Our everyday behavior is composed 
in large part of simple, well-defined 
responses made to any one of a large 
class of stimuli. The stimuli may 
remain the same from day to day, but 
learning varies membership in the 
class of stimuli to which the response 
is made. From a practical point of 
view, therefore, it is of considerable 
importance to discover what happens 
to such behavior when the rules are 
changed, i.e., when the same response 
must be made to a new class of stimuli 
which may or may not overlap the old 
class. In this situation the external 
stimuli remain unchanged, and the 
only alteration is in the nonspecific 
stimuli introduced by verbal instruc- 
tions or by differential reinforcement. 

Most theoretical analyses of trans- 
fer or of discrimination reversal con- 
cern themselves with similarity rela- 
tions between original and transfer 
test stimuli (1, 2, 5, 6) and do not 
really bear on the question asked here. 
However, insofar as they do cover the 
present situation they suggest that 
habit interference can be predicted 
from specific factors alone (degree of 
practice, intraserial stimulus similar- 
ity, etc.). Possible interactions be- 
tween instructional sets and previous 
practice are rarely considered. The 
experiment reported here was designed 
to explore such interactions. 


1 This experiment was done at the Perceptual 
and Motor Skills Research Laboratory, USAF 
Air Training Command Human Resources Re- 
search Center, Lackland Air Force Base, San 
Antonio, Texas. The opinions or conclusions 
contained in this report are those of the author. 
They are not to be construed as reflecting the 
views or indorsement of the Department of the 
Air Force. 


APPARATUS AND PROCEDURE 


The stimuli used were the 26 letters of the 
alphabet, printed in block capitals and photo- 
graphed on 35-mm. film, one letter to a frame. 
The negatives were then spliced to form a con- 
tinuous film strip 260 frames long, and random 
in sequence except for the restriction that each 
letter appeared just ten times. Eight strips 
were made, varying only in sequence of letters. 
The letters were flashed on a beaded screen from 
a projection window behind the Ss. An East- 
man 35-mm. film strip projector was modified to 
present the letters automatically at the rate of 
one frame per second. Changing frames required 
.2 sec., so that each letter was visible for .8 sec. 
The screen was 11 ft. from the Ss and the image 
of the letters (white on black) was 20 in. high on 
the screen. 

The task was to respond by pushing a button 
whenever any one of eight predesignated letters 
appeared on the screen. The Ss were tested in 
groups of four. They were seated at a long table 
which was divided by partitions into four com- 
partments so that they were unable to see one 
another. The partitions did not prevent them 
from hearing one another, but interaction was 
minimized by assigning each of the four Ss a 
different set of letters and by instructing them 
that this was the case. Each S had a response 
box before him, and recorded his responses 
directly on an Esterline Angus operations re- 
corder. The Ss were all airmen undergoing basic 
training at Lackland Air Force Base. 

Nine groups of four Ss each were given eight 
trials—four in the morning and four in the after- 
noon. Approximately 5-min. rest was given 
between trials within a set of four. A trial was 
one sequence of 260 letters. After a short prac- 
tice series in which S responded once to each of 
four letters, each S was given a slip of paper with 
eight letters, and told to respond whenever any 
one of the letters appeared. Each S responded 
to the same set of eight letters for the first five 
trials. 

On Trial 6 each S was given a new set of 
letters. For one S in each group the letters were 
the same that he had been responding to all 
along, but were arranged in a different order on 
the paper. The nine Ss so tested will be referred 
to hereafter as the “same” group. 

One other S in each group was given a new 
set of eight letters, none of which was the same 
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as his original set. These nine Ss will be called 
the “different” group. The other two Ss of 
each group were given new sets consisting of four 
letters from their original set of eight, and four 
new letters. The order of the old letters on the 
new slip of paper was scrambled. These 18 Ss 
will be called the “half” group. 

For Trials 7 and 8 all Ss responded to the 
same letters they were given on Trial 6. Sche- 
matically, the design looked like this: 


Group n First 5 Trials Last 3 Trials 

Same 9 ABCDEFGH GCFABHED 
Half 188 ABCDEFGH IBGJKELF 
Different 9 ABCDEFGH IJKLMNOP 


The actual letters assigned varied from man to 
man. In all cases the selection of letters and 
their order (within the framework of the design) 
was determined by a table of random numbers. 
The slips of paper containing the original 
letters were given to the Ss about 1 min. before 
the beginning of the first trial. They could study 
the letters then, and also during the 5-min. rest 
between trials. The slips were available during 
trials, but the 1-sec. rate of presentation left little 
time for studying the letters. On the first trans- 
fer trial, Ss again had about 1 min. to study the 
new letters before the trial began, with possible 
study during and between subsequent trials. 


RESULTS 


The design allowed a comparison of 
errors from one condition to another, 
and between errors made to different 
types of stimuli within an experimental 


condition. In order to make such 
comparisons meaningful it was neces- 
sary to reduce all scores to a common 
unit. The measure used was the num- 
ber of errors per S divided by the 
possible number of such errors. Table 
1 gives the results, expressed both as 
mean number of errors per S and as 
percentages of the possible number of 
errors. The mean of all errors is also 
presented for each group. This is 
simply the sum of the means of the 
different types of errors. 

On the first five trials for all three 
conditions there were 18 letters unas- 
signed, each occurring ten times, allow- 
ing 180 opportunities per trial per S 
for errors of commission (responses to 
letters not currently assigned). The 
9.3% errors of commission for the 
“half” group on Trial 1 is 16.8 divided 
by 180. For errors of omission, there 
were only 80 opportunities per S per 
trial. Thus the 14.6% error of the 
“half” group is 11.7 divided by 80. 

On the last three trials the oppor- 
tunities for error differed from one 
group to another. For the “same” 
group the percentages were calculated 
in the same fashion for all eight trials, 


TABLE 1 


Tria, By Tria AveraGeE Errors PER SusjecT AND Per Cent Error (NuMBER OF 
Errors Divinep sy Opportunity oF OccuRRENCE) 


























Errors of Commission by Group 
“*Same” “ Different” “Half” 
Trial 
Not Asgd. Not Asgd. om ie Not Asgd. Sage, Me ht 
Mean % Mean % Mean % Mean % Mean % 

1 14.7 8.2 12.9 7.2 16.8 9.3 
2 7.1 3.9 7.4 4.1 10.6 5.9 
3 . 1.8 2.8 1.6 6.5 3.6 
4 1.8 1.0 ee 1.3 4.6 2.6 
5 1.8 1.0 2.4 1.3 4.4 2.4 
6 1.3 0.7 9.4 9.4 5.4] 6.8 4.8 3.4 10.4 26.0 
7 1.1 0.6 3.8 3.8 5.2 6.5 2.1 1.5 6.3 15.8 
8 0.8 0.4 1.9 1.9 2.8 3.5 1.9 1.4 4.8 12.0 
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INSTRUCTIONAL SETS AND HABIT INTERFERENCE 


since these Ss were assigned the same 
letters in a different arrangement. 

For the “different” group, eight 
letters were assigned for the first five 
trials, and eight different letters for 
the last three, leaving only ten letters 
on which erroneous responses to unas- 
signed letters could be made. Thus, 
9.4% is 9.4 over 100. There were 80 
opportunities per S per trial for mak- 
ing errors by responding to letters that 
had been assigned for the first five 
trials but were not assigned for the 
last three trials. There were also 80 
opportunities per trial for making 
errors of omission. 

The “half” group had only four new 
letters assigned on the last three trials, 
so that there were 14 unassigned letters 
that could be responded to. The 3.4% 
of such errors on Trial 6 is 4.8 divided 
by 140. For each of the other three 
categories of error there were 40 oppor- 
tunities per trial. 

Table 1 shows that the “same” 
group was influenced little, if any, by 
the change in arrangement of letters 
assigned. Errors of omission show a 
slight rise, but this is more than offset 
by a drop in the errors of commission, 
so that the average of all errors shows 
an almost unbroken fall. 
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The “half” and the “different” 
groups both showed a pronounced in- 
crease in errors from Trial 5 to Trial 6. 
In the case of the “different” group, 
the average of all errors on Trial 6 
was identical to that on Trial 1—27.9 
errors on both trials. For the “half” 
group, there were slightly fewer errors 
on Trial 6 than on Trial 1 (26.6 and 
28.6, respectively), but the difference 
is far from significant (t = .28). From 
this analysis it would appear that the 
“different” and the “half” groups had 
responded in a very similar manner to 
the conditions imposed on them. 

However, an analysis of the relative 
frequency of the different types of 
errors shows that this was not the 
case, and that the two sets of instruc- 
tions not only had different effects in 
this experiment, but might be expected 
to produce different results in other 
situations. 

The “different” group showed no 
evidence of negative transfer. The 
average of all errors they made on 
Trial 6 was the same as that on Trial 1. 
The breakdown into errors of commis- 
sion and omission did not correspond 
quite so neatly, but in neither case did 
the difference between Trials 1 and 6 
approach significance. It might be 


TABLE 1 (Continued) 
































Errors of Omission by Group Total Errors 
“Same” “* Different” “Half” 
“Same” | “ Diff” | “Half” 
Asgd. Tr. 1-8 | Asgd. Tr. 1-5 | Asgd. Tr. 6-8 | Asgd. Tr. 1-8 | Asgd. Tr. 6-8 
Mean % Mean % Mean % Mean % Mean | % Mean Mean Mean 
16.9 | 21.1 | 15.0 | 18.8 11.7 | 14.6 31.6 27.9 28.6 
3.7 4.6 2.8 3.5 3.8 4.8 10.8 10.2 14.4 
1.8 2.2 1.2 1.5 2.3 2.9 5.0 4.0 8.8 
1.6 2.0 0.6 0.8 1.2 1.5 3.4 2.9 5.8 
0.6 0.8 0.4 0.5 1.0 1.2 2.4 2.9 5.4 
0.8 1.0 13.0 | 16.2 5.4 | 13.5 5.9 14.8 2.1 27.9 26.6 
0.4 0.5 7.2 9.0 1.2 3.0 | 2.7 6.8 1.5 16.2 12.4 
0.4 0.5 3.6 4.5 0.9 2.2 1.3 3.2 1.2 8.2 8.9 
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argued that this is not a valid com- 
parison, and that five-trials practice 
on a similar task should have pro- 
duced a gain in performance. The 
task is so simple that this is doubtful, 
and pilot studies involving responses 
to different sets of letters on each trial 
showed no practice effect. 

Moreover, the internal evidence 
shows no sign of interference in this 
group. Fewer errors were made in 
response to letters that had previously 
been assigned than were made to 
letters never assigned (6.8% error on 
previously assigned letters, and 9.4% 
on letters never assigned). This dif- 
ference between percentages is not 
significant (t = .61), but the direction 
of the difference indicates no evidence 
for the carry-over of old responses. 

For the “half” group there was evi- 
dence of negative transfer. It does 
not appear in terms of an increase in 
the total number of errors from Trial 1 
to Trial 6. There was actually a de- 
crease in total errors from 28.6 per S 
on Trial 1 to 26.6 on Trial 6. This 
decrease is more than accounted for 
by the reduction in responses to letters 
that were never assigned. There was 
an average of 9.3% of such errors on 
Trial 1 and 3.4% on Trial 6. This dif- 
ference is highly significant (t = 4.53). 
Errors of omission were not signifi- 
cantly different on Trials 1 and 6. 

The negative transfer appears in the 
carry-over of previously practiced 
responses. There was an average of 
10.4 such responses per S—26% of the 
opportunities for occurrence. This 
contrasts markedly with the 6.8% of 
such errors in the “different” group 
(t = 3.62 for percentages). Moreover, 
the direction and significance of the 
difference are not dependent on the 
correction factor applied to equate 
opportunity of occurrence. The un- 
corrected average number of carry- 
over errors was greater in the “half” 


group than in the “different” group 
(10.4 and 5.4 respectively, t = 2.98). 

Internal evidence for habit inter- 
ference in the “half” group is also 
strong. The group averaged over 
twice as many carry-over errors as 
errors made to unassigned letters (10.4 
and 4.8 respectively, t = 3.52). In 
terms of percentage of possible errors 
this difference is even greater. 

The most obvious interpretation of 
these carry-over errors would suggest 
that some Ss misread or forgot the 
new letter assignments, and consis- 
tently responded to one or two of the 
old letters not now assigned. The 
data do not support such an interpre- 
tation. If we rank the previously 
assigned letters for each man of the 
“half” group in the order of erroneous 
responses to them, then a high fre- 
quency of error on the first letter or 
two might indicate such a tendency. 
Actually, the average errors fall off 
gradually and regularly—the means 
being 4.71, 3.16, 1.78, and .72 (10 
possible). Seven of the 18 Ss made 
one or more errors on all four of the 
letters. Eight more Ss made errors 
on three of the four letters, and the 
three Ss who confined their errors to 
two or fewer letters were the three 
with the lowest totals of such errors 
(0, 3, and 4). 

The high frequency of carry-over 
errors in the “half” group might be 
expected to be accompanied by posi- 
tive transfer in the case of previously 
practiced letters still assigned. Table 
1 shows very little evidence for this. 
The tendency to omit newly assigned 
letters was only slightly greater than 
the tendency to omit letters that had 
been assigned on all trials. There 
were an equal number of each type of 
letter, and 13.5% error on the old 
letters as compared with 14.8% on the 
new ones. The difference is not signi- 
ficant (¢ = .40), and neither percent- 














INSTRUCTIONAL SETS AND HABIT INTERFERENCE 


age of omission differs significantly 
from the omissions in the “‘different’’ 
group. Inspection of Table 1 shows 
that this difference was larger on 
Trials 7 and 8, and suggests that in 
this case the consideration of only 
Trial 6 might be misleading. Adding 
together the omissions on the last 
three trials gives a more reliable score, 
as well as a possible measure of more 
persistent transfer effects. The dif- 
ference for all three trials is somewhat 
more in favor of positive transfer (a 
mean difference of 2.4 more omissions 
of responses to new letters, t = 1.45), 
but the difference is not significant. 


Discussion 


The difference between the “dif- 
ferent” and the “half” groups with 
respect to negative transfer, or carry- 
over of the old response, poses no par- 
ticular problem for theories of trans- 
fer. The momentary, external stim- 
ulus situation was the same for both 
groups, but the sets given by instruc- 
tions to the two groups were quite 
different. A stimulus-response inter- 
pretation must assume that the in- 
structions set up some sort of stimulus 
complex that persisted throughout 
learning, and that the transfer instruc- 
tions set up a new complex. For the 
“half” group these two stimulus com- 
plexes were more similar than they 
were for the “different” group. Stim- 
ulus-response transfer theories have 
not explicitly covered this situation, 
but it would seem that they could 
easily be extended to do so. 

A Gestalt theory of response pro- 
duction advanced by Lewin (3, 4) also 
predicts these results. Lewin denied 
the importance of such concepts as 
associative strength, and assumed in- 
stead that the intention, or determin- 
ing tendency, of S gave rise to the 
response. He predicted that negative 
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transfer occurs only when two of these 
tendencies are in conflict. This was 
the case in the “half” group. Part of 
the time the Ss were correct in doing 
what they had previously learned to 
do; hence the tendency to perform in 
the manner previously practiced 
comes in conflict with the tendency to 
follow instructions. No such conflict 
occurred in the “‘different”’ condition, 
since the old responses were never 
correct. 

Both types of theory agree in pre- 
dicting the difference between the 
groups with respect to negative trans- 
fer, and they should agree again in 
predicting positive transfer in the 
“thalf”’ group on the previously prac- 
ticed responses. The results, how- 
ever, give no support to this prediction. 
Previously practiced responses were 
correct only slightly (and insignifi- 
cantly) more often than responses to 
newly assigned letters. Even if further 
experimentation showed this differ- 
ence to be significant, some explana- 
tion would still be necessary to account 
for its small size relative to the nega- 
tive transfer. 

Although a theory could undoubt- 
edly be developed to include the results 
reported here, it would have to include 
considerably more than a considera- 
tion of associations between external 
stimuli and response. It would have 
to account for the absence of overt 
habit interference in the “different” 
group as well as for the presence of 
interference in the “half” group. This 
calls for a much more explicit theory 
of the effects of instructions, and prob- 
ably for the assumption of some sort 
of mediating process. Since the ten- 
dency to respond to the old letters 
should have been stronger than it was 
to the newly assigned ones, something 
must have intervened. This process 
could have been the identification of 
the set of old letters. They were well 
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practiced and were, therefore, easily 
recognizable. But under the instruc- 
tions given, none of these easily recog- 
nized letters called for a response. On 
the other hand, the letters that had 
never been assigned could not be as 
easily differentiated from the newly 
assigned letters, and confusion between 
them would be expected. 

If this sort of assumption is valid, 
then a transfer test earlier in learning 
might give evidence of overt negative 
transfer in the “different” group. This 
prediction is an insecure one, since it 
depends on a faster initial build-up 
of the specific response tendency than 
of the recognition process. On the 
other hand, if the importance of the 
identification is increased, it should be 
possible to get significantly more errors 
of commission to letters never assigned 
than to previously practiced letters. 
This might be the case with a larger 
class of stimuli than letters, and an 
increased number of elements to which 
the subject must respond. 


SUMMARY 


Subjects were shown a sequence of 
260 letters, one letter at a time, with 
instructions to respond by pushing a 
button whenever any one of a set of 
eight predesignated letters appeared. 
After five trials with the same letters, 
Ss were divided into three groups for 
transfer tests. Nine Ss were assigned 
the same letters in a scrambled order 
(“same” group). Nine more Ss were 
assigned an entirely new set of eight 
letters (“different” group). Eighteen 
Ss were assigned a set of four new and 
four old letters (“half” group). 


The results showed: 


1. Complete positive transfer for 
the “same” group. 

2. No evidence of negative transfer 
in the “different” group. 

3. Evidence of negative transfer in 
the carry-over of old responses in the 
“half” group (previously practiced 
letters not assigned on the transfer 
trials). 

4. No significant evidence for posi- 
tive transfer in the “half” group on 
letters previously practiced and still 
assigned. 

The results are interpreted as indi- 
cating the need for assigning a more 
important role in theories of transfer 
to sets induced by instructions. 


(Received March 18, 1952) 
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NONINDEPENDENCE OF SUCCESSIVE RESPONSES IN 
MEASUREMENTS OF THE VISUAL THRESHOLD! 


WILLIAM S. VERPLANCK, * GEORGE H. COLLIER,* AND JOHN W. COTTON ‘ 


Indiana University 


Historically, psychophysics has been 
concerned with two kinds of functions. 
The first kind, that with which we 
shall be concerned here, includes func- 
tions that relate the probability of 
a response (e.g., “yes,” “greater,” 
“equal”’) to the spatial, temporal, and 
energy characteristics of stimuli; these 
are variously termed “psychometric,” 
“threshold,” or, as we shall call them 
here, “Pg” (probability of response) 
functions. The second type includes 
those that relate certain parameters 
(e.g., the abscissa of inflexion or 50% 
threshold and the SD) of functions 
of the first type to some physical di- 
mension, again, of the stimulus. An 
example, in vision, of the first type is 
the function relating log intensity of a 
visual stimulus to the probability of 
response (3, 18). Examples of the 
second type are the visibility (4, 31), 
AI/I (14, 16), and dark-adaptation 
curves (3, 13). 

Operationally, then, the great ma- 
jority of investigations, both basic and 
applied, in the sensory and perceptual 
area have been concerned with first 
obtaining functions of the first type, 
that is, with determining the fre- 
quency of occurrence of some response 
as a function of the physical properties 
of the stimulating situation. The 


1 The experiment reported in this paper was 
performed in the Department of Psychology, 
Indiana University, under ONR Contract N6onr- 
180, T. O. IV, Project 143-253, and with support 
from a grant by the Graduate School of Arts and 
Sciences. * Work is continuing at Harvard Uni- 
versity under Contract N5ori-07639. 

2 Now at Harvard University. 

* Now at Duke University. 

4 Now at Northwestern University. 


quantitative values of the various 
parameters of those functions have 
been treated as isomorphic indicators 
of other processes, such as mental 
events (sensations), events in the ner- 
vous system (nerve fibre discharges or 
additions of “elements’’), or physical 
events at the receptors (absorption of 
certain numbers of quanta of light by 
the retina). The sensory, neural, pho- 
tochemical, or physical processes that 
have been postulated to generate the 
observed functions have been inferred 
from the functions themselves, and 
have seldom, if ever, been available to 
direct or collateral observation. Limits 
have been placed on such inference by 
information obtained in other disci- 
plines. Physiological, biochemical, 
and histological evidence, often scanty, 
has served to keep in check the variety 
of inferences drawn. Thus, the his- 
tological structure of the retina has 
placed limitations on retinal models 
proposed in theories of visual acuity. 
In vision, such men as Hecht (15), 
Wald (30), Crozier (3), and Graham 
(9), have proposed theories concerning 
the physical, photochemical, or neural 
processes responsible for observed 
functions. 

In repeated determinations of Pp 
functions, it is found that the relative 
frequencies of a categorized response, 
such as “‘yes’”’ or “no,” to a series of 
presentations of stimuli varying along 
a single dimension under fixed experi- 
mental conditions (7) converge sto- 
chastically to stable values. It has, 
moreover, usually been assumed that, 
for an intertrial interval greater than 
the “relaxation time” of the eye, ap- 
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proximately .1 sec.,® successive trials 
yield a population of responses that 
are independent events (e.g., 6, 11, 
18, 25, 26, 28). Various distribution 
functions (phi-gamma, Poisson, log- 
normal) have been proposed as relat- 
ing, empirically or theoretically, the 
probability of response to variations in 
the stimulus dimension. As with the 
second class of function, various psy- 
chological, physiological, and physical 
processes have been suggested as gen- 
erating the distribution functions fitted 
to the data, e.g., Guilford’s fluctuating 
attention (10), Crozier’s fluctuating 
thresholds of the excitable elements of 
the central nervous system (3), Hecht’s 
quantal variability of the stimulus 
(18), and perhaps even Hull’s sOpr 
(20). 

One interesting example of a theory 
of the Pg function is that of Hecht 
(18). He found that under his condi- 
tions of stimulation, the amount of 
energy necessary to produce a “yes 
(I saw it)” on 60% of all the occasions 
on which it was presented fell in the 
range 2.0 to 6.0 X 10-” ergs at the 
cornea, or, after correction for reflec- 
tion, transmission, and absorption, 5 
to 14 quanta at the retina. If it is 
assumed with Hecht that it is neces- 
sary for one quantum of light to be 
absorbed by each of 5 to 14 rods if a 
“‘yes” is to occur, then the relation- 
ship of response probability to stim- 
ulus intensity may be generated solely 
in terms of the quantal variability of 
the stimulus. On this hypothesis, 
Hecht derived a Poisson distribution 
function (predicting the absorption of 
n quanta by the rods) for the Pg func- 
tions which he obtained experimen- 
tally. The theoretical curves appeared 


5 A best estimate from the literature on tem- 
poral summation. 

* The assumption of independence is seldom, 
if ever, made explicitly, but it is implicit in the 
statistical methods employed. 


to fit his data very well. One obvious 
corollary of this theory is that, since 
by the basic physical theory, the prob- 
ability of m quanta arriving at the 
retina on any trial is independent of 
what may have occurred on preceding 
trials, then, if the time since the pre- 
ceding stimulation is greater than the 
“relaxation time,” or critical duration, 
the probability of a response is depen- 
dent solely on the delivery, by the 
stimulating flash, of the appropriate 
number of quanta, and is, hence, inde- 
pendent of preceding responses. 

As we have seen, the generation of a 
threshold function involves the re- 
peated presentation of stimuli, and 
the measurement of probabilities of 
response to them. Trials are defined 
and intertrial intervals as well. The 
dependent variable, relative fre- 
quency, or probability of response is 
one which has been employed by many 
behavior theorists. Operationally, a 
“threshold measurement” is not un- 
like the kind of experiment performed 
in studies of learning, performance, 
and endogenous behavior (1, 2, 5, 8, 
21, 22, 23, 27). In view of this simi- 
larity, it seemed likely to the writers 
that many previous formulations must 
necessarily have neglected the opera- 
tion of a number of variables govern- 
ing behavior, and thus must be depen- 
dent for their verification on the use of 
a relatively limited set of values of 
these variables. Hence, too, such 
theories may be incomplete. 

With this point of view, we have 
been concerned with the investigation 
of behavioral variables, such as the 
number of trials, intertrial intervals, 
and “reinforcing stimuli’ on the 
threshold function. This paper reports 
one of a series of such investigations. 

The first indication of the operation 
of behavioral variables in threshold 
measurements appeared in some ex- 
ploratory investigations that yielded 
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data suggesting that statistical sequen- 
tial dependence of successive responses 
may be found with intertrial intervals 
greater than the critical duration, and 
beyond those over which temporal 
summation occurs. The assumption 
of independence of successive responses 
is involved in all attempts to fit theo- 
retically the psychophysical function 
by the normal probability integral, by 
its phi-gamma modification, or by the 
Poisson function, and therefore, to all 
theories of receptor process based on 
such fits. Hartline (12), in as yet 
unpublished research, found that this 
assumption holds for the elaboration 
of single nerve impulses in single fibre 
preparations of the Limulus eye, when 
these are stimulated some 300 succes- 
sive times by a stimulus of fixed and 
low intensity at invariant stimulus 
intervals of 5 sec. or greater. At in- 
tervals of from 2 to 5 sec., the occur- 
rence of a response may reduce the 
probability that a second flash would 
yield a response. 

The present experiment is designed 
to investigate systematically the sta- 
tistical independence of responses to 
successive visual stimulations of the 
human, following a procedure modeled 
after that of Hartline. 


APPARATUS AND PROCEDURE 


Apparatus.—The apparatus used in this ex- 
periment was a Model III Hecht-Schlaer Adap- 
tometer (17). This instrument presents binoc- 
ularly a circular stimulus area subtending 3° at 
the retina, that is presented 7° below a red fixa- 
tion point. The viewing distance is held at 25 
cm. by a headrest. 

A pendulum shutter provides for the presen- 
tation of the illuminated stimulus area to S for 
a period of .2 sec. The wavelength distribution 
of the flash is determined by a glass filter with 
maximum transmittance at 480 mu. Both color 
temperature and intensity of the light source are 
held constant by the use of a fixed filament 
current. The luminance of the stimulus patch 
is continuously variable from 1.5 log yyL to 5.4 
log uuL by means of a neutral optical wedge and 
an auxiliary neutral filter. Calibration is main- 
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tained photometrically. As the instrument was 
used in this experiment, flashes could be pre- 
sented either at fixed mechanically determined 
intertrial intervals, or whenever S pressed a key. 
The apparatus was housed in a sound-deadened 
and light-tight room. 

Subjects —Sixteen paid Indiana University 
undergraduates, 12 men and 4 women, served as 
Ss. All met or exceeded Naval visual stand- 
ards as tested by the Orthorater (20/20 visual 
acuity O.D., O.S.; no phorias; “color normal”). 
All were previously inexperienced in visual ex- 
perimentation, so that complete control of their 
experimental histories was possible.” 

Our Ss remained entirely uninformed of the 
purpose, procedures, and design of the experi- 
ment. They were instructed to maintain fixa- 
tion, and to say “yes” if they saw a flash of light 
when the shutter operated, and “‘no” if they did 
not. When the experimental periods began, each 
S had spent 30 min. in dark-adaptation goggles 
and 10 min. in complete darkness. No special 
treatments distinguished between training, con- 
trol, and experimental days, or among procedures 
of stimulation, except as specified later. 

Procedures—Two procedures of stimulation 
were employed. The first follows the method of 
limits, in which each series, ascending or descend- 
ing, consisted of flashes differing successively in 
steps of .10 log units (LU), presented at 5-sec. 
intervals. Each series was terminated when 
three successive responses (“yes” in ascending, 
“no” in descending series) were made, and the 
next series, in the opposite direction, began with 
luminance .20 LU higher or lower, respectively, 
than the terminal luminance of the preceding 
series. The second procedure is the single lumi- 
nance method. Since the problem to be investi- 
gated in this experiment is the stochastic inde- 
pendence of successive responses, it is necessary 
to eliminate the effect of changing intensities of 


7 Despite the frequent use of highly trained Ss 
in visual-threshold research, we chose to employ 
inexperienced persons. In view of Hecht and 
Shlaer’s report (17) and of our own experience 
that a naive S will begin to make highly reliable 
judgments following a short period of training on 
threshold measurements, no loss of consistency 
of report need be expected. Certain advantages 
are attached to the use of such Ss. When a 
response is suspected to be a function of previous 
behavior in a situation, it is desirable to have 
complete control over, and records of, such be- 
havior, which is possible only with Ss untrained 
prior to the beginning of the experiment. Fur- 
ther, Ss are less likely to divine the purpose of an 
experiment if their previous experience is lim- 
ited, and spurious results, attributable to “sug- 
gestion,” may be precluded. 
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stimulation on the probability of response. The 
method, therefore, consists of the presentation to 
S of a series of 300 flashes of constant luminance. 
The luminance is selected so that the probability 
of a “yes” in response to it by each S will be ap- 
proximately .50. These flashes were presented in 
two ways. On some days, they automatically fol- 
lowed one another at 5-sec. intervals (A). On 
others, S was instructed to present a flash to 
himself whenever he was “ready” for it, by 
pressing a key (S).8 In the latter procedure, the 
interstimulus interval was determined by S, who 
presented stimuli to himself at “optimal” times. 
The Ss were allowed to take a 30-sec. rest when 
they wished. Insofar as possible, all sessions 
were run at the same time of day, on alternate 
days. 

Experimental design—Five preliminary ses- 
sions were given. Each such session consisted of 
ten ascending and ten descending series of the 
method of limits. From the data of each of the 
last four sessions, the luminance yielding a rela- 
tive frequency of “yes” of .50 was determined for 
each S. The mean of these values was taken as 
the best estimate of S’s 50% threshold. These 
preliminary sessions provided each S with a 
standardized practice, and ensured that Ss 
yielded results indicating that they followed the 
instructions, and that were comparable to the 
results usually obtained in threshold measure- 
ments. Four experimental sessions followed. 
Each began and ended with six ascending and 
six descending series on the method of limits. 
The middle period of these sessions was given 
over to the single-luminance method, in which 
each S was presented with 300 flashes; the lumi- 
nance of each of these was set at the 50% thresh- 
old measured in the last four preliminary sessions. 
On two days, the intertrial interval was 5 sec. 
(A); on the other two days, the intertrial interval 
was controlled by S(S). The 16 Ss were divided 
into four groups and each group was assigned 
one of four combinations of the two intertrial 
intervals over the four days of experimentation. 
The first group followed the order AASS, the 
second SSAA, the third ASAS, and the fourth 
SASA. 

One control session was run for six randomly 
selected Ss. This day differed from the experi- 
mental days only in that approximately 60 
“catch” trials were introduced into the series of 
300 single-luminance trials. Since it might well 
be argued that any results might be due to 
“false” responses (“‘yeses” not contingent on 
stimulation), and since the statistical treatment 
forbade that these be introduced on the experi- 


§’This procedure aimed at duplicating the 
method used by Hecht in his experiments on the 
quantum theory of the threshold (18). 


mental days, this final day was considered neces- 
sary. All other conditions were the same as on 
preceding days and Ss were not informed of the 
change in procedure. 


RESULTS 
Method of limits: Thresholds —The 


relative frequency of “yes” responses 
was plotted against log luminance (log 
uuL) on normal-probability paper for 
each of the last four preliminary ses- 
sions on the method of limits. A 
straight line was visually fitted to 
these plots and the threshold (lumi- 
nanceyielding 50% “yes”’ responses) to 
the nearest tenth of a log unit was 
determined by interpolation.® For 
each S a mean threshold for the four 
days was determined. This lumi- 
nance value is presented in the second 
column of Table 1. The average of 
the mean 50% thresholds for all Ss 
was 2.68 log uuL, with an SD of .30 
up. 

Method of limits and single-luminance 
frequencies of response.—Table 1 pre- 
sents the relative frequency of “‘yes”’ 
responses given by each S in each of 
the four experimental sessions. The 
mean of the distribution of differences 
between the .50 (predicted from the 
mean threshold values) and these rela- 
tive frequencies was .065 and the 
sigma .182. Attest of the hypothesis 
that the population mean of this dif- 
ference distribution is zero does not 
permit its rejection (p > 5% with 
t = 1.57 for 15 df). Within the limits 
of these experimental procedures, and 
assuming a normal distribution of 
thresholds, the 50% threshold, deter- 
mined in the method of limits sessions, 
yielded, when employed in the single 


*We do not assume that the relative fre- 
quency of “yes” versus log luminance curves 
employed are normal-probability ogives; this 
type of plot was used because it gives a con- 
venient and reproducible graphical method for 
interpolating the 50% threshold values. It 
works. 
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TABLE 1 
Since Luminance Metuop 
Prelim- D D D Control 
P we: ll ay 1 lay 2 ay 3 Day 4 Senton 
Group | S 
30% RF |N RF | N N RF |N RF Fal 
Threshold “Yes” owl CRr “Yes” wool CRr “Yes” Runs CRR}. Yes’ Runs CRR “Veses™ 
(log ppl) 
1 2.6 MS1(D SSH B14) S29 2 | 7.0) .70 | 28 | 9.6) .03 
AASS 2 2.3 64 | 22 | 9.7| .35 3 |-0.7) .79 | 50 | 4.9) .35 | 65 | 0.8} 05 
“ 3 2.5 65 | 58 | 2.6) .84 | 32 | 3.9) 84] 19 | 9.4] .85 | 30 | 4.0 
4 2.4 50 | 60 | 3.5) .42 | 63 | 3.5) .49 | 60 | 2.4) .42 | 62 | 2.6 .05 
1 2.7 39 | 52 | 4.8) .30 | 31 | 8.8) .39 | 59 | 3.0) 48 | 54 | 48 
SSAA 2 2.7 62 | 64 | 0.4) .61 | 56 | 3.7) .60 | 68 | 0.9) .48 | 70 1.1 
3 2.9 38 | 44 | 6.5) .43 | 48 | 6.0) .53 | 51 | 5.5) .59 | 45 | 6.6] .00 
4 2.8 81 | 14 |12.2} .90 | 18 | 5.5) .90 | 19 | 4.9) .88 | 22 5.1 
1 2.7 45 | 45 | 6.8) .66 | 33 | 8.8] .62 | 41 7.3} .58 | 30 | 10.2 
ASAS 2 2.7 .20 | 28 | 7.3) .50 | 42 | 7.6) .47 | 49 | 6.0) .81 | 17 |11.0} .02 
aed 3 2.2 57 | 73 | O.1| .36 | 52 | 4.3) .20 | 39 | 3.3] .35 | 58 | 2.6 
4 2.7 33 | 42 | 5.7| .19 | 13 | 12.1] .56 | 29 | 10.6] .52 | 19 | 13.0 
l 2.7 64 | 67 | 0.5] .36 | 33 | 9.0] .37 | 26 | 8.3] .12 | 17 | 8.4 
SASA 2 3.0 78 | 36 | 5.4) .63 | 45 | 6.2] .99 3 | 4.8) .87 | 26 | 4.4) .02 
re 3 2.5 34 | 45 | 5.7) .38 | 36 | 8.5) .09 | 15 | 6.0) .35 | 47 | 5.5 
4 3.6 57 | 31 {10.0} .38 | 50 | 5.1] .55 | 41 Tm ae | oe i SS 
Mean 2.68 03 















































luminance method, on the average, 
50% “yes” responses.’ 

False responses; the control session.— 
The last column of Table 1 presents 
the ratio of “yeses”’ to the total num- 
ber of blank trials given. In two 
cases, this approaches 5%; in the other 
four, it is less. This proportion of 
“false” responses is small and cannot 
seriously affect our conclusions, al- 
though, to be sure, one would prefer 
that statistical necessity did not make 
it impossible to evaluate this basal Pz 
in each experimental session. This 


10 This test is valid if the population sampled 
is normally distributed. A chi-square test of 
this assumption did not lead to its rejection here 
(p > 5% with X? = 3.92 for 2 df). Because of 
the small number of df, the probability of a beta- 
type error is large; so that we cannot accept the 
hypothesis of normality with any assurance. 
That the values sampled are means lends weight 
to the hypothesis by the Central Limit Theorem. 


resting rate of response is somewhat 
analogous to an operant level. 
Frequencies of response as a function 
of method of stimulus presentation. 
The relative frequencies of “yes” 
responses for each S were averaged 
separately for the days when the Ss 
stimulated themselves (S), and when 
they were stimulated automatically 
(A). Two corresponding distributions 
of differences from .50 were derived. 
The mean and SD of the A distribu- 
tion were, respectively, .048 and .179, 
and those of the S distribution were 
078 and .208. When the variances 
of the two distributions were com- 
pared, the null hypothesis could not 
be rejected (p > 5% with F = 1.34 
for 15, 15 df). To test the homoge- 
neity of means, a distribution of dif- 
ferences between the mean relative 
frequency of “yes” for the A and S 
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conditions was obtained. The mean 
difference was equal to .033 and the 
SD .169. A test of the hypothesis 
that this mean differed significantly 
from zero did not lead to its rejection 
(p > 5% with t = 0.79 for 15 df). 
Once again, our conclusions with 
respect to homogeneity of variance 
and homogeneity of mean relative fre- 
quencies are subject to the restriction 
that the tests of the normality of the 
distributions are not very good with so 
few df available. 

Summarizing, within the limits of 
the present experimental procedures, 
and the assumption of normal distri- 
butions, the method of limits trials 
yielded 50% thresholds which on the 
average gave 50% “yes” responses on 
the single-luminance trials. The two 
intertrial procedures, self-stimulation 
and automatic stimulation at 5-sec. 
intervals, did not alter significantly 
the percentages of “yeses” given to 
the fixed luminance nor did one con- 
dition result in a more variable per- 
centage of “yeses.” 

Sequential analysis: The serial corre- 
lation from trial to trial (lag 1).—In the 
determination of probability of re- 
sponse functions, a sequence of trials is 
given; in our case, Ss respond with 
“ves (I saw it)” or “no (I did not see 
it)”’ to 300 successive presentations of 
a stimulus patch of invariant bright- 
ness. This procedure yields a time- 
ordered sequence of responses such as 
Yes, Yes, No, No, No, Yes, No, Yes, 
Yes, Yes, . . . . From the relative fre- 
quency of “‘yeses” to all 300 stimula- 
tions, an estimate of the probability of 
a “yes” to any given stimulation, under 
these conditions, can be obtained. 

It has been typically assumed, im- 
plicitly or explicitly, that the successive 
responses, YYNNY .. ., are inde- 
pendent of one another, that is, that 
the probability of obtaining a “yes” 
(or a “no’’) on any trial does not 


depend on the response given to the 
preceding stimulus in the series. From 
this, it follows that the frequency of 
runs of responses (a run is defined as a 
sequence of identical responses pre- 
ceded and followed by a different 
response) of various lengths will occur 
according to certain distribution laws. 
If significantly more runs occur than 
are predicted by these laws (i.e., if S 
alternates YNYNYN .. .), or fewer 
than predicted (YYYNNN .. .), then 
the assumption of independence must 
be rejected, and the data exhibit serial 
dependencies. 

In order to test the hypothesis that 
successive responses are independent 
of one another, we have employed a 
nonparametric serial-correlation test 
described by Hoel (19)."" The serial 
coefficient of correlation, R, in this 
test, as we have used it, is a function 
of the sample size, the number of 
“‘yeses”’ in the total number of re- 
sponses, and the number of “yes” 
runs. R can vary between zero and 
plus infinity. Since the number of 
“‘veses”” varies from session to session, 
and since 7 is here relatively small for 
the computation of serial correlations, 
R’s obtained from different Ss on dif- 
ferent days are not directly compar- 
able. We have, therefore, used the 
critical ratio (CR) for their evalua- 
tion and comparison. This statistic 
has an n (0, 1) distribution. A signifi- 
cant positive CRpr indicates too few 


‘This test is an approximation of one devel- 
oped by Wald and Wolfowitz (29). It is based 
on the cross product term 


R = Lxix; -— 1. 


If +1 is assigned to each “yes” and —1 to each 
“no,” then for 300 responses 


R = 300 — 4(r), 
E(R) = (Si? — 300)/299, 
oR = 300 + (Sit — 1196 Sx? + 89400)/89102 


where: r—number of runs, S; — number of 
“yeses”’? minus the number of “noes.” 
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runs and hence a tendency to repeat a 
response just made; a significant nega- 
tive CR» indicates too many runs and 
hence a tendency to alternate. The 
results of this analysis are presented 
in Table 1, which presents both the 
number of runs and the CR» for each 
experimental session. 

Of the sixty-four CRp’s (16 Ss over 
four days) calculated, only seven failed 
to reach the 5% level of significance. 
The distribution of CRp’s, averaged 
over all days for each S, gave a mean 
of 5.87 and a sigma of 2.25. The 
probability of this distribution of 
CRp’s” on the assumption of M = 0 
and o = 1, is very much less than 1% 
(p < 1% with X* = 631.55 for 15 df) 
and (p < 1% with X* = ns?/o? = 
33.75 for 15 df). Serial correlation is 
significantly present. 

A comparison between the auto- 
matic- and self-stimulation conditions 
was made by subtracting the average 
CR» obtained with automatic stimu- 
lation at 5-sec. intervals from the 
average CR» obtained with self-stimu- 
lation. The mean of this distribution 
was 1.22, and the sigma 2.33. A test 
of the hypothesis that the mean differ- 
ence was zero could not be rejected 
(p > 5% with t = 2.03 for 15 df). 

A test of the homogeneity of vari- 
ance between the distribution of the 
mean CRp’s (over two days for each 
S) for self- and automatic-stimulation 
conditions again does not permit us to 
reject the null hypothesis (p > 5% 
for F = 1.88 with 15, 15 df). For 


12 Again, this test assumes a normal distribu- 
tion of CRr’s. Again, there are insufficient de- 
grees of freedom to make a reliable test of this 
hypothesis for the present data. However, evi- 
dence from other. experiments indicates that the 
distribution, of CRp’s is normal, with a mean 
different from zero and a sigma different from 
one in cases where significant serial correlation is 
present. A nonparametric sign test of the same 
hypothesis on the present data yields a probabil- 
ity between 5 and 10%. 
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further analyses the two sets of data 
were combined. 

We may conclude that there is a 
significant serial correlation under 
both experimental conditions, and 
that there is no significant difference 
between the two conditions in the de- 
gree of serial correlation, as measured 
by the critical-ratio test. All but one 
of the CRp»’s are positive, that is, runs 
are longer than expected on the hy- 
pothesis of a random sequence of 
responses. Subjects tend to repeat 
responses that they have just made. 
Serial dependencies are exhibited. 

Sequential analysis: Serial correla- 
tions at lags greater than one.—The 
demonstration of dependent proba- 
bilities between successive responses 
may suggest cyclical or long-range 
effects of some sort. These should 
reveal themselves in the form of sys- 
tematic changes in the index of serial 
correlation as the temporal interval 
between responses correlated increases. 
In order to investigate this possibility, 
serial-correlation coefficients at lags 
greater than one may be computed. 
That is, the correlation of the first 
response with the fourth, the fourth 
with the seventh, the seventh with the 
tenth, and so on (Lag 3) may be deter- 
mined. Thesecomputations have been 
made on the present data at Lags 3, 5, 
7, 9, and 20. If cyclic or regular 
changes in response probability occur, 
then the CR of the serial correlation 
coefficients should show corresponding 
changes. In Figure 1 are presented 
the mean critical ratios (over days and 
Ss) plotted against lag. Up to Lag 
20, the critical ratio appears to be a 
monotonic decreasing function of the 
lag. Long-range effects appear, but 
this analysis shows no evidence of 
orderly cycles. 

Examination of individual records, 
made by plotting subsets of a single 
S’s data on a given day on binomial- 
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probability paper (24) also reveals 
no effects at lags greater than 20. We 
do not find statistically significant 
lack of independence beyond about 
Lag 11, which, under our condition 
“automatic stimulation,” covers about 
1 min. 


Discussion 


Successive responses were found to 
be dependent upon one another under 
all conditions of this experiment. 
Runs of “‘yeses” and “‘noes”’ of length 
greater than expected on the hypoth- 
esis of independence occur. If S re- 
ports “‘yes” on one flash, he is more 
likely to report “yes” on the next. 

Furthermore, no statistically sig- 
nificant differences can be exhibited 
between either the percentage of the 
stimuli seen, or the dependent proba- 
bilities of successive responses when 
the stimulations are presented auto- 
matically, at a fixed 5-sec. interval, or 
when they are presented to S by him- 

18 This statistical tool exhibits serial depen- 


dencies in a single session’s data rapidly. We 
used it extensively in preliminary analyses. 


self when he is “ready.” Further 
work is necessary before intertrial in- 
terval can be considered to be an irrele- 
vant variable, since our Ss may have 
stimulated themselves at a rate close 
to the automatic one. 

Statistical nonindependence may be 
produced in either of two ways. First, 
the probability of a particular response 
may depend upon the identity of the 
previous response. Second, some as 
yet unidentified “‘state” variables may 
alter the probabilities of responses on 
trials close to one another in time— 
that is, there may be trends, cyclical 
or otherwise, in a time-ordered series, 
although its members are not depen- 
dent upon one another. Four cate- 
gories of variables suggest themselves 
as possibly responsible for such effects: 
(a) stimulus variables; (b) nonstimulus 
variables independent of preceding 
responses, of response history, and of 
preceding stimulation; (c) nonstimulus 
variables independent of preceding 
stimulation; (d) nonstimulus vari- 
ables dependent on preceding response 
history. 

Category (a) includes variables such 
as fluctuations in the intensity of the 
stimulating light, continuous versus 
discontinuous presentation of the fixa- 
tion point, concurrent stimulation 
(shutter noises, etc.), and so on. 
Fluctuations of the intensity of the 
stimulating light were controlled by 
our calibration and regulatory appa- 
ratus and cannot be considered a 
contributing variable. Extraneous 
stimulation was controlled by the use 
of a sound-deadened room, removal of 
relays to other rooms, and shutter- 
masking noises. 

Category (b) includes such variables 
as physiological state (blood pressure, 
etc.), changes in pupillary diameter, 
changes in the retinal site of stimula- 
tion owing to fluctuations in fixation, 
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gross motivational fluctuations that 
are not a function of the immediate 
stimulating conditions, changes in the 
criterion of the S, and so on. 

Category (c) includes such variables 
as adaptation, summation, changes in 
pupillary diameter produced by pre- 
ceding stimulation, fluctuations in the 
place of stimulation dependent on pre- 
ceding stimulation, etc. Adaptation 
is probably not a contributing vari- 
able, since its effect would be to reduce 
sensitivity, thereby decreasing the 
probability of response following a 
supraliminal flash, and so yielding neg- 
ative serial correlations. Retinal sum- 
mation over a five-sec. intertrial 
interval has never been reported to our 
knowledge. 

Category (d) includes such variables 
as conditioning (i.e., an increment to 
the probability of response produced 
by the occurrence of some reinforcing 
event after a preceding response), 
peripheral or central “orientation” be- 
havior dependent upon preceding 
response history, such as might result 
from preliminary training on the 
method of limits, motivational factors 
that are a function of the immediately 
preceding response history, and so on. 
At the present time, reports of experi- 
ments on the effects of many of these 
variables are in preparation. Other 
variables are still under investigation. 

In any event, these results have a 
number of implications, not only for 
theories about the visual (and other) 
thresholds, but also for all procedures 
which investigate the sensitivity of the 
intact human. Most particularly, 
theories of the threshold which either 
assume or predict independence may 
need revision, and conversely, some 
account of the variables which produce 
nonindependence must be made. The 
results raise other questions as well: 
What, for example, might be done to 
keep S at peak responsiveness? 
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SUMMARY AND CONCLUSIONS 


This experiment was performed in 
order to determine whether the suc- 
cessive responses of S given in measure- 
ments of his visual threshold are sta- 
tistically independent of one another. 
This is implicit in several theories of 
sensory thresholds. Sixteen Ss were 
presented, on four successive days, 
with 300 consecutive stimuli of a lumi- 
nance which had been found previously 
to be reported 50% of the time. On 
two days these stimuli were presented 
automatically at 5-sec. intervals. On 
the other two, S presented them to 
himself. Critical ratios of serial-corre- 
lation coefficients were computed on 
the time-ordered sequences of responses 
at Lags 1, 3, 5, 7, 9, 11, and 20. 

1. Under the conditions of this 
experiment the mean 50% absolute 
visual threshold for each subject elic- 
ited, on the average, 50% frequencies 
of responses on immediately subse- 
quent days. 

2. When S is presented with a dis- 
crete, dim flash of light, his response to 
it will depend not only upon the lumi- 
nance of the flash, but also, among 
other things, on how he has responded 
to preceding flashes of light of the 
same luminance. Each response is 
dependent upon previous responses, or 
perhaps both are dependent upon a 
third variable which varies in time. 

3. This nonindependence is exhib- 
ited with statistical significance when 
as many as 10 responses intervene 
between the correlated responses. 
The value of CRr seems to be a 
monotonically decreasing function of 
the lag. 

4. No statistically significant dif- 
ferences appeared between the data 
obtained under the two conditions of 
stimulation. 


(Received for priority publication, May 19, 1952) 
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FACTORS RELATED TO SPEED AND ACCURACY 
OF TACTUAL DISCRIMINATION ! 


T. R. AUSTIN AND R. B. SLEIGHT 
The Johns Hopkins University 


In a previous experiment (1), speed 
and accuracy of tactual discrimination 
were tested for 43 figures. These in- 
cluded the upper-case letters A through 
Z, the numerals 2 through 9, and nine 
geometric forms. Results showed that 
25 figures—or approximately 60% of 
the total number of stimuli—met or 
exceeded a 90% accuracy criterion. 
These results pertained to a solid set 
of figures when finger-tip movement 
was permitted. One implication of 
the study seemed to be that such tac- 
tual cues might be applied to instru- 
ment panels, dashboards, machinery 
switches, etc. 

The experiment described above was 
exploratory in nature. Its main pur- 
pose was to determine, in fact, if the 
finger tips could be employed in dis- 
criminating these types of figures. The 
positive evidence warranted further 
investigation, and this paper reports 
the results of a subsequent study. 

If tactual cues are to be utilized, it 
is appropriate to determine whether 
additional factors play a part in S’s 
ability to discriminate. The purpose 
of this experiment was, therefore, to 
determine if speed and accuracy of 
tactual discrimination were related to 
or affected by differences in sex, hand- 
edness, fingers employed in making 
the discriminations, learning, pressure 
exerted by the finger tip, and levels of 
subjective confidence. 


1This research was supported in part by 
Contract N5-ori-166, Task Order I, between the 
Systems Coordination Division, Naval Research 
Laboratory, Office of Naval Research, and The 
Johns Hopkins University. This is Report No. 
166-I-155, Project Designation No. NR-784-001, 
under that contract. 


APPARATUS 


The set of figures tested was selected from the 
solid group of figures used previously. These 
figures were cut from }-in. Masonite sheeting and 
were the maximum size which could be inscribed 
in 4-in. circles. Stroke width was held constant 
at2mm. Some slight alterations from common 
forms were made in the figures when such altera- 
tions enabled the over-all area of the figures to 
be increased within the 4-in. circles. The figures 
tested were the letters C, D, E, G, I, J, K, L, 
O, P, Q, T, U, V, W, X, and Y, the numerals 2, 
7, and 9, and the following geometric forms: 
diamond, right triangle, rectangle, crescent, and 
half-circle. These figures are the ones which met 
or exceeded a 90% accuracy criterion in the first 
study. 

The figures were mounted on a plywood disk 
which could be revolved 360° in the horizontal 
plane to expose one figure at a time through a 
slot in the handrest. The pivot pin in the center 
of the disk was secured to a lever which was 
displaced in the vertical plane by S’s pressure on 
the figure. The movement of the lever was 
geared to a synchro-generator, and the latter 
was electrically wired to a vacuum tube volt- 
meter. By means of this arrangement, the 
deflection of the voltmeter was linearly propor- 
tional to the amount of pressure exerted on the 
figure by the finger tip, and the voltage could 
easily be transformed into pressure units. 

The slightest pressure by S’s finger closed a 
microswitch which activated a clock in a voice 
reaction timer. When S responded, the timer 
was stopped by means of a throat microphone. 
The apparatus produced highly accurate meas- 
ures of reaction time and pressure exerted by the 
finger tip at the instant of discrimination. 


PROCEDURE 


The Ss were seated at a table on which the 
equipment was placed. The distance from S to 
the table was adjusted so that all discriminations 
were made with partially extended arms. The 
Ss were told that the task was to discriminate 
letters, numerals, and geometric forms, but were 


2 The authors wish to express their thanks and 
appreciation to Mr. T. W. Hollis, Jr., who de- 
signed and built the tactual discrimination 
apparatus. 
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TABLE 1 


OrperR oF Fincers TesTep For 16 Ss In A 
Muttiete Latin-Square DeEsIcn 
After the four initial trials, each S was tested in 
four additional trials with the finger 

employed first in his sequence. 

Roman numerals in the matrix 

of this table correspond to the 

fingers employed in each 
specific trial. 











Ss Trial 
Hand- 
Sex edness 1 2 3 4 5-8 








M | Left III | Il IV I Ill 





F | Right | II IV | Ill I II 





F | Left II I IlI | IV Il 























not told the number or relative frequency of each 
type. Instructions were as follows: 

“Place your hand in position and at the signal 
from the experimenter lower the specified finger 
on the figure. You may move your finger at 
will. Respond as soon as you think you are sure 
you know what the figure is. Speed is important 
but accuracy is of primary importance, so make 
sure of your discrimination before responding. 
After each response, indicate the confidence of 
your discrimination on a five-point scale.” (The 
five-point scale was thoroughly explained by £.) 
“Remember, accuracy should be your primary 
aim in each discrimination.” 

Accuracy was emphasized in the instructions 
as an attempt to guard against Ss’ responding 
too rapidly and not as accurately as they might. 
Reaction time, exerted pressure, and level of 
confidence were recorded for each response. The 
Ss were not blindfolded but were instructed not 
to look at the apparatus at any time throughout 
the experiment. They followed this instruction 
without exception. None of the Ss had previous 


experience with this type of tactual discrimi- 
nation. 

Sixteen Ss (eight men, four right-handed and 
four left-handed; and eight women, four right- 
handed and four left-handed) were tested on 
their four right-hand fingers (thumbs excluded). 
Handedness was established by asking each S 
questions from the Purdue Speech Clinic Later- 
ality Index. The order of testing fingers of each 
S was according to a multiple latin-square design 
as illustrated in Table 1. It is seen that none of 
the finger sequences was repeated. 

After each S had four trials, one with each 
finger in the specified latin-square sequence, he 
was given four additional trials with the finger 
which was tested first in each sequence. These 
additional trials were incorporated to enable an 
analysis of learning trends to be made. Each 
trial consisted of 25 discriminations, one for each 
figure in a random order. A different order was 
used for each S for each trial. 


RESULTS 


Accuracy.—Accuracy and reaction 
time were held to be the most impor- 
tant factors in the investigation. The 
increase in accuracy over the eight- 
trial period is shown in Fig. 1. Mean 
per cent accuracy increased from 
87.36% (Trial 1) to 100% (Trials 7 
and 8). Each mean is the average of 
400 discriminations (16 Ss, 25 discrim- 
nations per S). 

Reaction time.—In Fig. 2, mean reac- 
tion time is plotted as a function of 
successive trials. It is seen that dur- 
ing the eight-trial period, mean reac- 
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Fic. 1. Increase of mean per cent accuracy 
during the eight experimental trials. Each point 
is the mean of 400 discriminations (16 Ss, 25 
discriminations per S). 
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TACTUAL DISCRIMINATION 


tion time decreased from 9.11 sec. 
(Trial 1) to 3.52 sec. (Trial 8). Cor- 
responding to the steady decrease in 
reaction time was the variability of 
reaction-time scores. The SD’s de- 
creased from 2.29 on Trial 1 to 1.59 
on Trial 8. 

Confidence.—Although confidence 
never reached a mean five level (per- 
fect confidence), it increased through- 
out the range of trials. Figure 3 
expresses this relationship, and it may 
be seen that mean confidence increased 
from 3.99 on Trial 1 to 4.92 on Trials 
7 and 8. 

Pressure.—The pressure exerted by 
the finger tip at the time of each dis- 
crimination was read directly from the 
voltmeter by means of a special scale 
calibrated in ounces. Pressure was 
recorded by scoring a “1” for all dis- 
criminations effected with zero to 3 oz. 
of pressure, “2” for the 3- to 6-oz. 
interval, etc. In only 30 responses 
from a total of 3200—less than 1%— 
was the zero- to 3-oz. interval ex- 
ceeded. 

Sex, handedness, and fingers.—These 
factors obviously did not contribute a 
major source of error variance since 
accuracy reached the 100% level for 
all Ss tested. In addition, a simple 
analysis of variance was performed on 
the reaction-time scores of the first four 
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Fic. 2. Decrease of mean reaction time as a 
function of successive trials. Each point is the 
mean of 400 discriminations. 
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Fic. 3. Increase of mean confidence over the 


eight-trial period. Each point is the mean of 
400 discriminations. 


trials of each S to determine whether 
sex, handedness, or fingers differed 
significantly in terms of reaction time. 
Resulting variances for all three fac- 
tors were small enough to be neglected. 
Both accuracy and reaction time data, 
therefore, strongly suggest that there 
are no differences between males and 
females, left- and right-handed people, 
or any of the four right-hand fingers 
in ability to tactually discriminate 
these figures. 

Subject analysis.—All the above re- 
sults have been presented in terms of 
trial-by-trial analysis in order to ex- 
press the relationships between speed 
and accuracy and the various other 
factors over a series of repeated trials. 
It remains to compare speed, accu- 
racy, and confidence of the 16 indi- 
vidual Ss. 

Three product-moment correlations 
were computed: between reaction time 
and per cent accuracy, between reac- 
tion time and confidence, and between 
per cent accuracy and confidence. For 
each correlation there were 16 paired 
scores, one pair for each S. Each 
score was the mean of the 25 separate 
discriminations for each of the first 
four trials taken together. There- 
fore, a score used in calculating a cor- 
relation was the average of 100 dis- 
criminations. Only the first four 
trials were employed because beyond 
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this point there was very little disper- 
sion in reaction-time and confidence 
scores, and no dispersion in accuracy 
scores. It will be appreciated, how- 
ever, that due to the combination of 
repeated trials on the same stimulus 
materials, the correlated scores are 
comprised of both initial level of S 
performance and subsequent values 
which are influenced by rate of learn- 
ing. These correlations were com- 
puted to answer the following ques- 
tions: (a) Did Ss who responded 
accurately also respond quickly; (0) 
did Ss who responded quickly do so 
with high confidence; and (c) did Ss 
who responded accurately do so with 
high confidence? 

The product-moment r between 
mean reaction time and mean per cent 
accuracy was —.70. This r shows a 
significant relationship between short 
reaction time and high accuracy, and 
reciprocally, long reaction time and 
low accuracy, at beyond the 1% level 
of confidence. 

For the correlations involving con- 
fidence scores, only the mean number 
of “five” confidence reports was em- 
ployed. In other words, mean perfect 
confidence was correlated with reac- 
tion time and per cent accuracy. 

When the correlation between mean 
reaction time and mean confidence 
was calculated, the data yielded an r 
of —.94, indicating to a marked de- 
gree that high confidence was accom- 
panied by short reaction-time scores, 
and low confidence was accompanied 
by long reaction-time scores. 

The correlation between mean con- 
fidence and mean per cent accuracy 
resulted in an r of .47 (significant only 
at the 10% level). Prediction of ac- 
curacy from confidence scores, there- 
fore, would be less dependable than 
prediction of reaction time from con- 
fidence scores. 


Discussion 


Learning.—The learning that re- 
sulted from the repeated trials is 
apparent from an analysis of speed 
and accuracy scores. Both of these 
measures indicated that a great deal 
of learning did take place during the 
experimental sessions. Mean reaction 
time for the 25 figures on the first trial 
was shown to be 9.11 sec., and the 
corresponding mean accuracy was 
87.36%. Seven trials later, mean 
reaction time had decreased to 3.52 
sec., and mean accuracy had reached 
the 100% level. Accompanying these 
trends was a steady decrease in vari- 
ability of the reaction-time scores. 


Equipment operation.—When the speed and 
accuracy data are considered along with the 
evidence that sex, fingers, and handedness are 
nonsignificant factors in these discriminations, 
several implications of importance for selection 
and training problems may be seen. Any piece 
of equipment employing tactual cues would 
likely have to be standardized, i.e., be located in 
the same position, made of the same design and 
materials, etc., as demanded by production 
methods. These results indicate that males and 
females, and left- and right-handed operators are 
equally proficient in the utilization of the cues. 
Therefore, for example, the tactual array might 
always be situated to the right-hand side of any 
operator. Further, the results indicate that all 
four right-hand fingers may be employed with 
equal efficiency. This would suggest that all 
four fingers could be used on a battery of push 
buttons, thus enhancing the versatility of the 
array. The most important aspect of the results, 
however, is the fact that all persons can reach 
the 100% accuracy level and greatly diminish 
their reaction time in short periods of training. 
Because eight trials did not seem to allow suffi- 
cient practice for reaction-time scores to reach 
asymptotic values, it may be assumed that 
further training would increase speed still more 
while accuracy would remain at the 100% level. 

Confidence.—In general, confidence increased 
as accuracy increased and reaction time decreased 
throughout the eight trial sessions. Such an 
increase in confidence is to be expected with 
familiarity with the figures. 

An unexpected result was the high correlation 
between confidence and reaction time and the 
considerably lower correlation between confi- 
dence and accuracy in the subject analysis. 
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TACTUAL DISCRIMINATION 


What S is willing to state in the way of confidence 
is more highly correlated with how fast he is 
willing to state it than the accuracy with which 
the discrimination is made. Regardless of the 
low correlation between accuracy and confidence, 
however, each of the 16 Ss reached the 100% 
level of accuracy by his seventh trial. It there- 
fore seems that confidence was not a basic cri- 
terion for these discriminations, especially when 
accuracy was the most strongly emphasized 
factor. 

Pressure.—This measure was the most highly 
consistent and possibly one of the most impor- 
tant of those recorded. Approximately 99% of 
the discriminations were made by exerting be- 
tween zero and 3 oz. of pressure. This consis- 
tency is important when considered in terms of 
equipment design. If tactual cues should be 
used on push buttons, and if the push buttons 
are to be spring loaded, then the finger tip must 
depress the spring as well as afford a discrimi- 
nation. The consistency of pressure measure- 
ments in this experiment, both for different Ss 
and different fingers, suggests that a common 
standardized spring might possibly be built into 
all push buttons. 


SUMMARY AND CONCLUSIONS 


The purpose of this experiment was 
to determine if speed and accuracy of 
tactual discrimination were related to 
or affected by differences in sex, hand- 
edness, fingers employed in making 
the discriminations, learning, pressure 
exerted by the finger tips, and levels of 
subjective confidence. 

The stimulus figures utilized had all 
met or exceeded a 90% accuracy crite- 
rion in a previous experiment. The 
figures used were 17 upper-case letters, 
three numerals, and five geometric 
forms. 
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Sixteen Ss (eight men, four right- 
handed and four left-handed; and 
eight women, four right-handed and 
four left-handed) were tested. 

The results of the experiment seem 
to warrant the following conclusions: 

1. A great deal of learning took 
place during a short training session 
as shown by a marked increase in 
accuracy and decrease in reaction 
time. 

2. Levels of subjective confidence 
increased as accuracy increased and 
reaction time decreased throughout 
the eight-trial period. 

3. The product-moment correlation 
between mean per cent accuracy and 
mean reaction time was —.70; between 
mean confidence and mean reaction 
time —.94; and between mean confi- 
dence and mean per cent accuracy .47. 

4. In approximately 99% of the dis- 
criminations, pressures from zero to 
3 oz. were exerted in making the 
discriminations. 

5. Nosignificant differences resulted 
from discriminations made by left- 
versus right-handed Ss, males versus 
females, or any of the four right-hand 
fingers. 


(Received for priority publication 
June 20, 1952) 


REFERENCES 


1. Austin, T. R., & Steicut, R. B. Accuracy 
of tactual discrimination of letters, nu- 
merals, and geometric forms. jJ. exp. 


Psychol., 1952, 43, 239-247. 














FUNCTIONAL FIXEDNESS AS RELATED TO PROBLEM 
SOLVING: A REPETITION OF THREE 
EXPERIMENTS? 


ROBERT E. ADAMSON 


Stanford University 


The study of problem solving and 
thinking has been retarded by the lack 
of agreed-upon theoretical concepts 
supported by adequate data from ex- 
periments. As a part of a larger pro- 
gram concerned with these matters, 
some of the more promising hypotheses 
have been assembled, and preliminary 
experiments undertaken to repeat the 
demonstrations upon which these hy- 
potheses rest. 

One inviting hypothesis is that 
problem solving may in some instances 
be delayed through the “functional 
fixedness” of solution objects. That 
is, owing to his previous use of the 
object in a function dissimilar to that 
demanded by the present problem, S 
is inhibited in discovering the appro- 
priate new use of the object. This 
hypothesis was proposed by Duncker 
(3), who designed ingenious experi- 
ments to support it, but carried the 
experiments through with but 14 Ss 
and under poorly specified experimen- 
tal conditions. It seemed wise, there- 
fore, to repeat some of his experiments 
both to substantiate his results, if pos- 
sible, and to ascertain the efficacy of 
the problems for use in further investi- 
gations. The success of Birch and 
Rabinowitz (1) in demonstrating func- 
tional fixedness in a related experiment 


1 This experiment was the first in a series of 
studies of problem solving being done under 
Project NR 150-104 and supported by Contract 
Nonr 225 (02) between Stanford University and 
the Office of Naval Research. The work was 
done and the present report prepared under the 
supervision of Dr. Donald W. Taylor. Work on 
the contract is under the general direction of 
Dr. E. R. Hilgard. 


encouraged us to hope for positive 
results. 


PROCEDURE 


Subjects —All Ss taking part in this study 
were college students from elementary psychol- 
ogy classes. There were 57 Ss, of whom 35 were 
men, 22 women. Twenty-nine Ss were assigned 
to the experimental group, 28 to the control 
group. All Ss were of proximate ages and had 
been exposed to little experimentation. 

Problems.—Duncker’s “box,” “gimlet,”’ and 
“paperclip” problems were presented to each S 
in the order named. In the first of these, the 
box problem, S’s task is to mount three candles 
vertically on a screen, at a height of about 5 ft., 
using to accomplish this task any of a large num- 
ber of objects which are lying before S on a 
table. Among these objects are three pastc- 
board boxes of varying sizes, five matches, and 
five thumbtacks, the crucial objects for solution 
of the problem. The solution is to mount one 
candle on each box by melting wax on the box 
and sticking the candle to it, then to tack the 
boxes to the screen. 

The gimlet problem involves suspending three 
cords from a board attached to an overhead 
beam. Among the variety of objects available 
are two screw-hooks and the gimlet itself, the 
objects from which the cords may be hung. 

The paperclip problem consists of first attach- 
ing four small black cardboard squares to a large 
white square, then hanging the large square from 
an eyelet screwed into the aforementioned beam. 
Included among the objects lying before S on the 
table are a number of paperclips. These may 
be used to attach the small squares to the large 
one, and one of them, when bent to form a hook, 
will serve to hang the large square from the 
eyelet. 

Design.—The experimental and control groups 
were given the same problems to solve. For the 
experimental group, however, at least one of the 
solution objects was “burdened” with a prior 
function in each problem. Thus, the candles, 
matches, and tacks for the box problem were 
placed in the three boxes before they were given 
to S. Hence, the boxes had for their initial func- 
tion that of containing, whereas in their solution 
function they had to be used as supports or 
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platforms. Similarly the gimlet initially had to 
be used to start holes for the screw-hooks, and in 
the paperclip problem, the four black squares had 
to be attached to the white one with paperclips. 
Duncker referred to the experimental group as 
the “after pre-utilization” group. 

The control group was given the problems 
without any pre-utilization. In the case of the 
box problem, the empty boxes were placed on the 
table at varying distances from the other crucial 
solution objects. Holes into which the screw- 
hooks and the gimlet could be screwed were 
already drilled into the beam in the case of the 
gimlet problem; the four black squares were 
stapled to the white one in the paperclip problem. 
Thus, none of the crucial objects was used with a 
function prior to its use as a solution object. 

Solution scores were taken as one possible 
measure of functional fixedness, and time-to- 
solution constituted another measure. A maxi- 
mum time of 20 min. was allowed for solution of 
each of the problems. 


RESULTS 


Box problem.—The results of the box 
problem, presented in Table 1, confirm 
Duncker’s finding that functional fix- 
edness results from pre-utilization. 
The performance of the experimental 
group was markedly inferior to that of 
the control with respect both to the 
number of solutions obtained and the 
time required to reach solution. Prior 
usage of the boxes as containers inhib- 
ited their being used as platforms. 

The chi-square value comparing the 
two groups on the time score was 
obtained by using as a cutting point 
the median time-to-solution of the 
combined groups. All cases for which 
there was no solution were assigned to 
the above-median category. With 1 
df, each of the chi squares was highly 











significant. Since the direction of the 
TABLE 1 
Box ProsLemM 
Group | | Number Solving | Time-to-Solution* 
Exper. | 29 12 (41%) 7 (24%) 
Control | 28 24 (86%) 22 (78%) 


x? = 12.0p = .001 








x? = 14.8) = .001 








* Number below median of combined group. 
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TABLE 2 
GmLet AND Paperciip Prospiems 
Mean 
Problem - 
Time-to- 
and ; SD v * 
Group | " | Squution c 
Gimlet 
Exper. | 26 | 246.6 | 124.7 
Control| 28 | 144.0 | 67.7| 3:71 | -001 
Paperclip 
Exper. | 29 | 107.9 | 96.0 
Control| 28 | 63.0 | 31.5| 298 | 




















* Single-tail test. 


difference was predicted, a one-tail 
test of significance was employed for 
both this and the following two 
problems. 

Gimlet problem.—Since only three 
Ss failed to solve this problem, all from 
the experimental group, the solution 
score could not demonstrate a differ- 
ence between the experimental and 
control groups. Accordingly, only the 
results from the time measure are 
given in Table 2. The three Ss failing 
to reach solution were not considered in 
the analysis of the data, thus reducing 
the total m to 54 for this experiment. 

Since, as shown in Table 2, the vari- 
ances for the two groups are not homo- 
geneous, the use of t as a test of signifi- 
cance was inappropriate. Instead, ¢’ 
was employed.? The highly signifi- 
cant difference obtained shows clearly 
the presence of functional fixedness. 

Paperclip problem.—The results 
from the paperclip problem are also 
shown in Table 2. Since all Ss were 
able to solve this problem, only time 
scores are given. As in the first two 
problems, pre-utilization of the solu- 


? This technique was suggested by Dr. Quinn 
McNemar. Instead of utilizing one estimate of 
the population variance, t’ incorporates the esti- 
mated variances from two populations (2). It 
is, in consequence, useful in such a situation as 
the present one, replacing t which assumes homo- 
geneity of variance. 
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tion objects with a function different 
from that demanded by the problem 
resulted in significantly poorer per- 
formance by the experimental group. 

Reliability of individual perform- 
ance.—An analysis was made of the 
performance of the experimental group 
to determine whether individual 
achievement on one of the three prob- 
lems was significantly related to 
achievement on either of the other 
two. Chi square was used to test 
whether individuals scoring below the 
median in time-to-solution for one of 
two problems also showed a significant 
tendency to score below the median 
for the other. (Since only 12 of 29 Ss 
solved the box problem, a median 
time-to-solution could not beobtained; 
instead, the distribution was dichot- 
omized in terms of solution or no- 
solution.) A relation significant at 
the .05 level was found between the 
box problem and the paperclip prob- 
lem. Neither of the other chi squares 
was significant. Clearly, achievement 
on a single problem involving pre-utili- 
zation is not a reliable measure of 
individual susceptibility to functional 
fixedness. 


Discussion 


Duncker’s study (3), involving these 
three experiments, used two measures 
of performances number of presolu- 
tions, and number of solutions. In the 
present study, the number of pre- 
solutions was discarded as a measure, 
because it was found to be overly 
dependent upon the subjective judg- 
ment of £. Number of solutions 
proved to be a satisfactory measure 
for only the box problem. Since all Ss 
solved the paperclip problem, and all 
but three solved the gimlet problem, 
no difference between the experi- 
mental and the control groups could 
be revealed by this measure. It would 
appear that the Ss in the present study 


ADAMSON 


were more able than those employed 
by Duncker. 

Although the measures previously 
used by Duncker failed to show func- 
tional fixedness in two of the three 
present experiments, a new measure, 
time-to-solution, gave positive results 
in all three experiments. Essentially, 
then, the present results confirm those 
obtained by Duncker. 

The results of Duncker, of Birch 
and Rabinowitz, and those obtained 
in this study afford convincing proof 
of the existence of functional fixed- 
ness. The reality of this phenomenon 
having been established, two lines of 
investigation are of immediate inter- 
est: (a) determination of those condi- 
tions which influence the occurrence 
of functional fixedness, and (b) explo- 
ration of its relation to other kinds of 
set in problem solving. A study now 
nearing completion involves both of 
these lines of experimentation. 


SUMMARY 


1. Three of Duncker’s experiments 
on functional fixedness were repeated 
in this study. Fifty-seven Ss were 
used, 29 serving as the experimental 
and 28 as the control group. Both 
groups were given the “box,” “gim- 
let,” and “paperclip” problems in that 
order. Experimental Ss were given 
each problem after first having used 
the solution objects for that problem 
in a function dissimilar to that de- 
manded for solution. Control Ss were 
given the problems without such pre- 
utilization. 

2. Two measures of performance 
were used: number of solutions, and 
time-to-solution. The former meas- 
ure discriminated between the experi- 
mental and control groups on only the 
box problem; the latter measure gave 
highly significant differences in the ex- 
pected direction for all three problems. 














PROBLEM SOLVING 


3. Functional fixedness was shown 
to result from the pre-utilization of 
solution objects. Duncker’s results 
were confirmed in a study using a 
larger m and having more carefully 
specified experimental conditions. 


(Received for priority publication 
June 26, 1952) 
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SIMULTANEOUS VISION AND AUDITION: THE 
DETECTION OF ELEMENTS MISSING FROM 
OVERLEARNED SEQUENCES! 


G. H. MOWBRAY 


Cambridge University 


As transportation by land, sea, and 
air has become swifter and more com- 
plex, the necessity for communicating 
more information in shorter periods of 
time has arisen. This has led quite 
naturally to a critical evaluation of 
existing channels of communication 
and an extended search for new me- 
diums to exploit. As an outgrowth 
of the mounting interest displayed in 
the development of visual communica- 
tion systems, Cheatham (2) has sur- 
veyed the pertinent psychological 
literature on vision and audition in 
order to compare these two sense 
modalities as communication chan- 
nels. Harris (4), in an earlier report, 
has considered at some length the rela- 
tions between vision and audition 
from the standpoint of a dimensional 
comparison rather than a direct one 
based on a specific laboratory experi- 
ment. The numerous attempts to 
discover whether auditory or visual 
presentation of information is more 
efficient have recently been reviewed 
by Day and Beach (3). These auth- 


ors abstracted 11 generalizations from 


1 This research was supported in part by Con- 
tract N5-ori-166, Task Order I between the 
Systems Coordination Division, Naval Research 
Laboratory, Office of Naval Research, and The 
Johns Hopkins University. This is Report No. 
166-I-121, Project Designation No. NR-507-470, 
under that contract. The actual experimental 
work was done at The Psychological Laboratory, 
University of Cambridge, Cambridge, England 
and the author is grateful to Professor Sir 
Frederic Bartlett for the facilities and assistance 
provided at that laboratory. Thanks are also 
due to Dr. W. R. Garner of The Johns Hopkins 
University for his helpful criticisms and sug- 
gestions. 


the literature which specify the condi- 
tions under which visual or auditory 
presentation is superior. Except for 
a discussion of intersensory facilita- 
tion by Harris and a consideration of 
the combined visual and auditory pre- 
sentation of the same material by Day 
and Beach, all of these papers are 
exclusively concerned with the two 
sense modalities operating in isolation. 
While these are necessary and impor- 
tant steps to be taken, the picture can 
by no means be complete without data 
bearing on the question of the relative 
efficiency of the two sensory systems 
when they are simultaneously stim- 
ulated by conflicting material. 

The nearest approach to this prob- 
lem is provided by the classical experi- 
ments on division of attention which 
are reviewed by Woodworth (8). In 
almost every case, the simultaneous 
tasks provided have one sensory and 
one motor component although one or 
two are reported that combine a tac- 
tile and a visual discrimination or two 
simultaneous visual discriminations. 
The orientation of all of this research 
has been toward discovering if success- 
ful division of attention can be said to 
occur. No systematic attempt has 
been made to compare the relative per- 
formances of any two sensory modal- 
ities operating simultaneously. 

Previous research has shown that, 
for single sensory channel operation, 
the type of material utilized for com- 
parative purposes exerts a major influ- 
ence in determining which modality 
will dominate. Pohlmann (7) found 
that the visual sense was superior 
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when nonsense syllables were pre- 
sented but that the auditory mode 
dominated when meaningful material 
was used for testing. Carver (1) 
reported vision better than audition 
for difficult words but the exact oppo- 
site for easy words, and Larsen and 
Feder (5) partially confirmed this 
result when they found difficult but 
meaningful factual material was han- 
dled better visually. However, with 
easier factual material there was no 
difference between the sense modal- 
ities in question. In general, then, if 
the material used in testing is difficult 
or unfamiliar, the most effective man- 
ner of presenting it is probably by way 
of the visual channel. On the other 
hand, easy material may best be han- 
dled through audition, although there 
may well be no difference between the 
senses in this latter case. 


Purpose oF EXPERIMENT 


Since difficulty of material is appar- 
ently an important parameter in com- 
parative studies of sensory channels 
operating independently of each other, 
it may also be an important considera- 
tion when the modalities are operating 
simultaneously. The purpose of this 
experiment, therefore, was to test the 
relative efficiency of vision and audi- 
tion under conditions of simultaneous 
stimulation when “simultaneous” is 
understood to refer to the occurrence 
of a continuous series of events within 
the same span of time. 

By providing perceptual tasks of a 
continuous nature (as opposed to iso- 
lated, momentary perceptions) two 
things were accomplished. First, the 
necessity for and the technical diffi- 
culty involved in temporally equating 
the visual and auditory stimuli were 
in some measure obviated; and, sec- 
ond, a much closer approximation to 
a natural modus operandi was obtained. 
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The task that was selected as most 
adequately conforming to experimen- 
tal demands was one which required 
Ss to detect and report elements ran- 
domly deleted from familiar sequences. 
Series of symbols that were grossly 
overlearned were considered to pro- 
vide the necessary, familiar substratum 
as well as to permit the selection of 
varying levels of difficulty on the basis 
of the amount of overlearning within a 
particular sequence. Since two se- 
quences are already established in most 
literate, English-speaking adults which 
can reasonably be said to be over- 
learned by differing amounts, these 
were utilized for this experiment. The 
sequences in question are English 
alphabets and Arabic numerals. 


MeETHOD 


Materials.—The visual sequences consisted of 
thin white strips of cardboard on which were 
printed English alphabets or Arabic numeral 
sequences from 1 to 20. All sequences had either 
2, 3, 4, 5, or 6 elements missing as determined by 
a random-selection process. Samples of a typ- 
ical alphabet and a typical numeral sequence are 
presented in Fig. 1. Both of these samples have 
three elements deleted. In a given sequence 
(either numeral or letter) any symbol could be 
omitted as determined by chance. The auditory 
stimuli were similar sequences recorded on mag- 
netic tape. 

Procedure.—The Ss were seated in a darkened 
room facing a white square of cardboard to which 
was affixed the visual stimulus. They were told 
that following a ready signal they would be 
shown a numeral sequence (or an alphabet, de- 
pending on the condition being tested) for a brief 
period of time, and that they were to read it 
through once from left to right as rapidly as they 
could and then write down those numerals (or 
letters) which they had detected as missing from 
the sequence. They were never told what the 


123567 8101113 14 151617 181920 


ABCEF GHI JKLNPQRSTUVWXYZ 


Fic. 1. A sample of an alphabet and a num- 
eral sequence. Each sequence has three elements 


deleted. 
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range of possibilities was—whether there would 
be none missing or ten missing or all missing. 
This was the procedure for all nonsimultaneous 
visual trials. Exposure of the stimulus was ac- 
complished by flashing a beam of light on the 
cardboard stimulus strip by means of a hand- 
operated shutter mechanism on a slide projector. 
The exposure time for all visual trials was 6 sec., 
which was barely long enough for the fastest Ss 
to skim through the sequences once. The audi- 
tory trials, similarly, were of 6-sec. duration, and 
all sequences were preceded by a warning note 
recorded on the magnetic tape 3 sec. before the 
beginning of the series. 

Practice trials —The Ss were given practice 
on both the auditory and the visual trials with 
both numerals and alphabets until no trial-to- 
trial improvements were noted in the number of 
detection errors committed. Two or three ex- 
perimental sessions were all that were required 
for Ss to reach a performance plateau, and con- 
trol checks made throughout the course of the 
experiments showed no further improvements. 
All practice trials consisted of the visual and 
auditory conditions being presented separately. 

Nonsimultaneous trials—Following the prac- 
tice trials, Ss were tested on all four nonsimul- 
taneous conditions (i.e., visual numeral, auditory 
numeral, visual alphabet, and auditory alpha- 
bet) to obtain response scores in terms of the 
average number of detection errors per sequence 
as a function of the number of elements missing 
in order to define the basic level of single sensory 
channel operation foreach S. Two, and in some 
cases three, experimental sessions were devoted 
to collecting scores for each of the four stimulus 
conditions just mentioned. One experimental 
session consisted of 50 sequences—10 each with 
2, 3, 4, 5, and 6 elements missing. There was a 
random order of presentation of sequences within 
an experimental session. 

Simultaneous trials —The simultaneous trials 
included all combinations of visual and auditory 
sequences that were possible with the restriction 
of double-channel stimulation imposed—visual 
numeral and auditory numeral, visual numeral 
and auditory alphabet, visual alphabet and audi- 
tory alphabet, and visual alphabet and auditory 
numeral. Each experimental session consisted 
of 50 sequences presented to one modality and 
50 sequences presented to the other. Within 
each experimental session all possible combina- 
tions of numbers of elements missing in one 
sequence were presented with all others an equal 
number of times. That is to say, the auditory 
sequences were composed of ten with two ele- 
ments missing, ten with three elements missing, 
ten with four elements missing, etc.; the visual 
sequences were similarly constructed. The audi- 


tory sequences with two elements missing and 
the visual sequences with two elements missing 
were presented together the same number of 
times as all other combinations. Since there 
were 25 combinations, each was presented twice 
in any one experimental session. When the sim- 
ultaneous presentations consisted of visual num- 
erals and auditory numerals or visual alphabets 
and auditory alphabets, the problem of the coin- 
cidence of deleted elements presented itself. 
Since for any given sequence the missing ele- 
ments were selected randomly, it was possible, 
on a chance basis, for the same symbol to be miss- 
ing in both the auditory and visual sequence. 
For half of the simultaneous trials Ss recorded 
their visual results first and auditory results 
second. On the other half of the trials this pro- 
cedure was reversed. Two experimental sessions 
were devoted to each of the four simultaneous 
conditions stated previously. 

Subjects—Ten enlisted men in the British 
Navy, whose ages were between 17 and 29, 
served as Ss. Most Ss contributed 600 non- 
simultaneous observations, but some contributed 
only 400 since it was found that the experimental 
time could be reduced without seriously affecting 
the precision of the results. All Ss contributed 
400 simultaneous observations on 800 sequences. 


REsULTS 


Types of error—The nature of the 
perceptual task made it possible to 
commit two types of detection errors. 
The first of these consists of the failure 
to report elements which were missing 
from the sequence; these have been 
termed errors of omission. The second 
consists of reports of elements missing 
which were, in fact, present in the 
series; these have been termed errors 
of commission. 

Nonsimultaneous trials —Since the 
data from the simultaneous trials have 
been analyzed with respect to S’s per- 
formance when only one sensory chan- 
nel was being stimulated at a time, the 
first results of importance are those 
relating to this latter condition. Fig- 
ure 2 shows the average number of 
errors per sequence as a function of 
the number of elements missing for all 
nonsimultaneous presentations and 
for all Ss. For greater clarity the 
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Fic. 2. The average number of errors per sequence as a function of the number of elements 
missing for all nonsimultaneous conditions 


numeral series and alphabet series are 
presented on different coordinates. 
The three groups of functions repre- 
sent total errors, errors of omission, 
and errors of commission for both 
visual and auditory sequences. In 
considering the numeral series first, 
what is immediately evident is the 
considerably greater number of total 
errors committed with the auditory 
stimuli. When the total errors are 
broken down into errors of commission 
and errors of omission, it is seen that 
this discrepancy is largely caused by 


the fact that about five times more 
errors of commission were made au- 
rally than were made visually. How- 
ever, the number of errors of omission 
is significantly greater for the auditory 
sequences than for the visual sequences 
when the results are averaged over the 
five conditions of numbers of elements 
missing and subjected to a ¢ test for 
correlated scores (p<.001). Thus, 
more of both types of errors were made 
aurally than visually, but the greatest 
difference between the modalities for 
the nonsimultaneous numeral series 
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was in terms of elements reported as 
missing which were not missing. 

The results from the alphabet series 
show a similar trend with one impor- 
tant exception. The difference be- 
tween the visual and auditory series, 
as regards errors of omission, is not 
statistically significant. What differ- 
ences do exist are entirely caused by 
the greater number of auditory errors 
of commission. So, for both types of 
material, errors of commission were 
made more often with the auditory 
than with the visual stimuli. Signi- 
ficantly more correct detections of 
missing elements were made visually 
with the numeral series, but with the 
alphabet sequences there was no dif- 
ference between the senses. 

Alphabets and numeral sequences 
were chosen, it will be remembered, 
because they were considered to be 
overlearned to different degrees and 
so, it was thought, would provide two 
separate levels of task difficulty. A 
comparison of the error scores for the 
two types of material as pictured in 
Fig. 2 shows that this expectation was 
successfully fulfilled. Roughly two 
times as many errors were made with 
the alphabet sequences as with the 
numerals. 

The results from individual Ss 
looked at separately parallel closely 
the averaged result shown in Fig. 2. 
With the numeral sequences, 7 out of 
10 Ss made better scores visually than 
aurally, 1 exhibited no difference, and 
2 made better scores aurally. With 
the alphabets, 3 Ss performed better 
visually, 5 showed no systematic bias, 
and 2 displayed better performance 
with the auditory material. The 2 
who showed an aural preference for 
the numeral sequences were not the 
same 2 who showed an aural prefer- 
ence for the alphabet sequences. 

Simultaneous trials.—In considering 
the simultaneous performances, the 


important data are not absolute error 
scores on the simultaneous trials but 
rather error scores for the simultane- 
ous trials relative to nonsimultaneous 
operation. Since an increase in errors 
for the simultaneous detections is to 
be expected because of the conflicting 
nature of the material, a useful way of 
considering these data is in terms of 
the average increase in errors (over 
nonsimultaneous performance) as a 
function of the number of elements 
missing. Figure 3 shows the data 
from the simultaneous trials plotted 
in this manner. These curves were 
obtained by subtracting the average 
error scores obtained under nonsimul- 
taneous conditions of presentations 
from the appropriate scores for the 
simultaneous trials. For example, 
when the auditory alphabets were pre- 
sented with the visual numerals (one 
of the four simultaneous conditions 
mentioned previously), the error scores 
for the auditory alphabets on the non- 
simultaneous trials were subtracted 
from the error scores obtained for the 
auditory alphabets. A like procedure 
was used for the visual numerals, and 
the two remainders were plotted on 
the same coordinates. The result is 
the pair of functions shown at the top 
of Fig. 3. Errors of omission and 
errors of commission are plotted 
separately. 

In looking first at the errors of 
omission in Fig. 3, the most clear-cut 
result concerns the manner in which 
the type of material used affected the 
increase in detection errors. Both 
cases that combined a difficult task 
and an easier one displayed significant 
differences in increase of errors with 
the simultaneous material. The inter- 
esting thing is that in both instances 
the easier task was subject to the 
greatest increase in errors. Those 
trials that paired tasks of nearer equal 
difficulty showed no significant differ- 








cE 
L 


OF ERRORS PER SEQUEN 


AVERAGE NUMBER 








SIMULTANEOUS VISION AND AUDITION 





VISUAL NUMERALS 
AND AUDITORY ALPHABET 


AUDITORY | OMISSION 


VISUAL ERRORS OF 
AUDITORY | COMMISSION 


VISUAL } Sarees or 


se co 


gee 


1.OF et Oo 
— 
oO 0 


4S. + ° 














297 


ences in error increases from nonsimul- 
taneous to simultaneous performance. 

The errors of commission presented 
in Fig. 3 provide an interesting anal- 
ysis, likewise. The significant differ- 
ences in the increase of this type of 
error follow the channel stimulated 
rather than the kind of material util- 
ized in the detection task. The great- 
est increase in errors was made in 
every case with the visual material, 
which gave the fewest errors under 
nonsimultaneous conditions. In most 
instances, as can be seen from the neg- 
ative scores in Fig. 3, there was a de- 
crease in auditory errors from nonsim- 
ultaneous to simultaneous operation. 


TABLE 1* 


ANALyYsIS OF VARIANCE OF INCREASED ERRORS 
or Omission Due To SIMULTANEOUS 
STIMULATION 











Source df } 9 F r 
Between materials (M) 1 |20,478.1 |103.00 | >.001 
Between channels (C) 1 23.5 
Between subjects (S) 9 976.5 4.90 019 
Interaction (MXC) 1 351.6 1.76 
Interaction (M XS) 9 $71.1 2.87 
Interaction (CXS) u 612.5 3.07 
Triple interaction 9 199.2 

Total 39 
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Fic. 3. Average increase in errors per se- 
quence as a function of the number of elements 
missing for all simultaneous conditions 





_ ™Scores used for analysis were the average increase 
in errors per sequence X 100. 


Two analyses of variance were per- 
formed on the foregoing results to 
provide a statistical analysis of the 
trends indicated by the graphs. In 
view of the fact that each S was used 
as his own control and a correlation 
factor was involved in the raw score 
used for the analysis, the method of 
extracting variances suggested by 
McNemar (6, p. 306) was utilized. 

Table 1 shows the results from the 
analysis of variance of the errors of 
omission. From this analysis it is 
clear that the most important increase 
in errors due to simultaneous stimula- 
tion is caused by the type of material 
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TABLE 2* 


ANALYsIs OF VARIANCE oF INCREASED Errors 
or Commission Due To SIMULTANEOUS 




















STIMULATION 
Source df =. F ? 

Between materials (M) 1 | 2,433.6 3. 
Between channels (C) 1 | 9,985.6 | 15.5 .006 
Between subjects SS) 9 | 2,790.9 4.3 .035 
Interaction (MXC) 1 62.5 
Interaction (MXS) 9} 1,472.1 2.3 
Interaction (CXS) 9 993.8 1.5 
Triple interaction 

MXCXS) 9 643.8 

otal 39 








* Scores used for analysis we by average increase 
in errors per sequence X 100 + 1 


rather than by the sensory channel. 
The increase in errors was greater when 
the detection task was an easy one 
than when it was more difficult, i.e., 
the efficiency in detecting deleted ele- 
ments deteriorated more under con- 
ditions of simultaneous stimulation 
when the task was an easy one than 
when it was more difficult. On the 
other hand, Table 2, which analyzes 
the variances contributed by the errors 
of commission, shows quite an oppo- 
siteresult. In this case, the only signifi- 
cant difference, except that contributed 
by individual differences, derives from 
the channel stimulated and not from 
the type of material used in the test 
situation. The greatest increase in 
errors from nonsimultaneous to simul- 
taneous stimulation is, in this case, 
contributed by the visual stimuli. 


Discussion 


That there is a basic difference in 
absolute level of performance between 
vision and audition has been shown 
very often. Day and Beach (3), in 
their survey of the literature on this 
point, give strong support to the thesis 
that whichever modality dominates 
depends to a large extent on the type 
of experimental material used; and, 
more explicitly, previous research re- 
sults suggest that vision has an advan- 


tage with difficult material, and audi- 
tion may or may not have an advantage 
with easier material. This research 
has failed to confirm such a finding. 
Figure 2 shows, on the contrary, that 
vision predominated with the easier 
task while the harder one failed to 
show any differences between the 
modalities. Since, however, most of 
the previous research has been con- 
cerned with measuring the retention 
of the presented material over longer 
periods of time than this experiment 
encompasses, the basic difference in 
the nature of the tasks probably makes 
a direct comparison of results invalid. 

The nature of the increase of errors 
of omission from nonsimultaneous to 
simultaneous observations indicates, 
in effect, that less missing elements 
were being detected by the respec- 
tive modalities. In other words, the 
responses per channel were fewer. In 
view of the conflicting nature of the 
stimuli this is not surprising. What 
is surprising, however, is that the in- 
crease in errors should be determined 
solely by the ease or difficulty of the 
perceptual task and not at all by the 
perceptual channelinvolved; and, what 
is more, that the greatest increase in 
errors should be with the material that 
was the least difficult. It may per- 
haps be argued that Ss concentrated 
more on the tasks that were difficult 
to the greater detriment of the easier 
ones. If the introspection of S be 
allowed, however, such an argument 
is considerably weakened in view of 
the fact that all Ss expressed a prefer- 
ence for the visual tasks regardless of 
the type of sequence being used. 
Whether or not an alleged preference 
for one task over another is an expres- 
sion of the relative difficulties of the 
two tasks is, of course, a debatable 
point. It is curious, however, that 
such a preference had no detectable 
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influence on the result of the simul- 
taneous trials. 

Some sort of end-effect might be 
invoked to explain the relative increase 
of errors with the easier material under 
the simultaneous conditions. The 
argument would run thus: Errors of 
omission, which contribute by far the 
most to the effect under discussion, are 
the reverse of successfully detected 
missing elements. If the number of 
successful detections was already very 
low for the difficult material under the 
nonsimultaneous conditions, then de- 
terioration of detection scores, or 
increase in errors of omission, could 
not be relatively very great with the 
simultaneous tasks. Table 3 shows 
that such an hypothesis is extremely 
unlikely. Here are listed the over-all 
percentages of correct detections of 
missing elements for both levels of 
material difficulty and for both sen- 
sory channels. It is evident that the 
percentage of correctly reported letters 
missing from the alphabets could have 
dropped far below the 32% level with- 
out being critically near the zero point. 

A reasonable hypothesis to advance 
in explanation of the large difference 
in errors of commission between visual 
and auditory trials for the nonsimul- 
taneous experiments seems to reside 
in the greater referability of the visual 
stimuli. Although the visual se- 
quences were presented for only a 
limited amount of time, Ss were in no 
way constrained from spending as 


TABLE 3 


Per Cent Correct Detections ror Ten Ss 
AND FOR ALL NONSIMULTANEOUS AND 
StmuLTANEOus ConpITIONS 








Nonsimultaneous Simultaneous 


Modality 





Alphabet |Numerals |Alphabet |Numerals 


51.6 76.0 31.6 43.0 
54.0 71.1 31.8 39.1 





Visual 
Auditory 
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much of the available time as they 
liked on any portion of the sequences 
no matter how small. Although they 
were requested to skim through the 
sequences once, rapidly, there was 
nothing to prevent them from looking 
back and checking up on elements that 
were missing. The fleeting nature of 
the auditory stimulus, on the other 
hand, probably caused a greater 
amount of uncertainty and perhaps 
encouraged guessing. 

This analysis can only be applied to 
the errors of commission made on the 
simultaneous trials if it is somewhat 
modified. Here the absolute number 
of errors made with the auditory mate- 
rial was higher than with the visual 
material, but relative to nonsimul- 
taneous performance there was a large 
increase in visual errors and a small 
but consistent decrease in auditory 
ones. This research provides no direct 
answer to the problem, but it seems 
within the bounds of reason that the 
confidence with and preference for the 
visual task, which may be responsible 
for the relative absence of errors in the 
nonsimultaneous trials, might also 
lead Ss into a false sense of security 
when they are obliged to report on the 
simultaneously presented sequences. 


SUMMARY AND CONCLUSIONS 


The relative performance of vision 
and audition under conditions of sim- 
ultaneous stimulation was investi- 
gated for rapid scanning tasks consid- 
ered to exhibit two levels of difficulty. 
The Ss were required to detect ele- 
ments missing from alphabets and 
numeral sequences when such se- 
quences were presented simultaneously 
to the two senses in one of four com- 
binations—visual alphabet with audi- 
tory alphabet, visual numeral with 
auditory numeral, visual alphabet with 
auditory numeral, or visual numeral 
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with auditory alphabet. Performance 
on the simultaneous trials was com- 
pared with nonsimultaneous perform- 
ance of the same conditions. Analysis 
of performance was made on the basis 
of two types of errors. These were 
errors of omission, or deleted elements 
which were not reported as missing, 
and errors of commission which were 
elements reported as missing which 
were not in fact missing. The follow- 
ing conclusions may be drawn: 

1. For nonsimultaneous operation 
more errors of omission were made 
with the alphabet sequences than with 
the numeral sequences. Significantly 
more auditory than visual errors of 
omission were committed with the 
numeral sequences. With the alpha- 
bet sequences there was no significant 
difference between the visual and 
auditory errors of omission. 

2. For nonsimultaneous operation 
four to five times as many errors of 
commission were made with the audi- 
tory sequences as with the visual 
sequences. 

3. Increase in the number of errors 
of omission from nonsimultaneous to 
simultaneous trials because of the con- 
flicting nature of the material was not 
significantly greater for one modality 
or the other. Significant increases 
that did occur were attributable to the 
type of material used, i.e., numerals or 
alphabets. When a relatively easy 
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task was paired with a relatively diffi- 
cult one, significantly more errors 
were made with the easier task. 

4. Increase in the number of errors 
of commission from nonsimultaneous 
to simultaneous trials was significantly 
greater for the visual trials than for 
the auditory. 


(Received for priority publication 
June 30, 1952) 
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